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This  handbook  ia  tha  first  edition  of  a  supplemental  text  to  support 
the  curriculum  presented  in  the  resident  graduate  management  programs  of 
the  Air  force  Institute  of  Technology.  Zt  is  also  intended  to  be  used 
by  the  practicing  logistics  manager  as  a  ready  reference  following  a 
formal  academic  program. 


The  increasing  cost  and  complexity  of  our  defense  force  combined  with 
frequent  reductions  in  resources  requires  each  logistics  manager  to 
become  knowledgeable  of  the  quantitative  problem-solving  techniques 
available.  This  text  is  designed  to  serve  ss  a  reedy  reference  of  those 
quantitative  tools  most  frequently  used. 


Chapters  1  through  5  discuss  concepts  and  techniques  used  in 
cific  functional  areas  of  logistics  mansgammnt.  Chapters  6,  7,  and  • 
provide  a  brief  review  of  basic  statistics.  The  remaining  14  chapters 
cover  specific  quantitative  tools  used  to  mid  the  logistics  manager  ia  the 
decision-making  process.  Ko  classified  or  copyrighted  materiel  ia  contained 
in  this  handbook.  - 


The  handbook  is  a  compilation  of  the  work  of  several  authors.  Bach  is 
Identified  la  the  Table  of  Contents.  My  sincere  thanks  to  thorn  for  their 
effort.  I  also  wish  to  thank  and  acknowledge  Kathy  Taylor*  Shirley  Sawyer* 
and  Peggy  Lluheoh  for  their  handling  of  all  the  typing  and  nueerous  other 
details  in  a  prompt  and  aceurata  manner. 


April  1980 


JOHN  S.  ENGEL,  Lt  Ool*  OSAP 
EDITOR 


i 


Accession  For 


HTIS  GRAfcl 
OTIC  TAB 
Unannounced 
Justification. 


t 


□ 


Distribution/ 


Dlst 


Availability  Codes 
(Avail  and/or 
Special 


h 


r.-v* 

i 


sulk  or  contents 


SUPPLY,  Lt  Ool  Wayne  Kirk,  Maj  Jammu  Jenson  (AFRas)  ,  and 
Janes  Jackson. 

CIVIL  ENGINEERING  APPLICATIONS,  Maj  Todd  Stewart. 
CONTRACTING  APPLICATIONS,  Col  Larry  Seith  and  Capt  Thanes 

MAINTENANCE,  Lt  Ool  Richard  Badala— nte. 

TRANSPORTATION,  Maj  Joal  Know lea. 

BASIC  STATISTICAL  COHOBPT8,  Maj  Laalia  Zaabo. 

PBOBABILITT  DISTRIBUTIONS,  Maj  Luli*  Zumbo. 

ESTIMATION  AMD  BYPOTHESIB  TEST  IMG,  Maj  Laalia  Zaabo. 

imdbz  MW— s,  Ool  Larry  Sadth. 

TUSK  SERIES  FORECASTING  TECHNIQUES,  Richard  Morphy. 

GOST  ESTIMATIOH,  Maj  Laalia  Zaabo. 

LIFE  CYCLE  GOST,  Maj  Laalia  Zaabo. 

RBGRSSS10M  ANALYSIS,  Jack  Bale. 

THE  LEARNING  CURVE,  Jack  Bala. 

ANALYSIS  OH  VARIANCE,  Maj  Leslie  Zaabo. 

LINEAR  PROGRAMMING,  Maj  Todd  Stewart. 

NETWORK  ANALYSIS,  Maj  Todd  Stewart. 

INVENTORY  MODELS,  Maj  Thonaa  Barrington. 

QUEUING  MODELS,  Maj  Thonaa  Barrington. 

RELIABILITY,  Maj  Leslie  Zaabo. 

ECONOMIC  CONCEPTS,  Richard  Taliaferro. 

A 

NOWPARAMBTlilC  STATISTICS,  Capt  Frederick  Lawrence  and 
Lt  Col  John  Engel. 


SUPPLY 


Om  the  jftirt  our  Air  Force  Major  Co— anda  and  base  supply  units  have 
da— loped  nunarous  management  indicators  to  assist  in  guiding  their  efforts 
toward  improved  nission  support.  Many  of  those  indicators  have  been  modi- 
fisd  due  to  changes  in  eemput—  programs  and  management  needs.  Frequently, 
of  flows  new  to  the  Air  Force  and  officers  from  the  rated  career  field  are 
assigned  to  base  level  supply  management  positions.  They  may  find  they 
have  a  lack  of  understanding  of  the  stqpply  "business"  and  need  information 
on  which  to  base  manages— t  decisions.  This  chapter  was  developed  to 
assist  the  officers  new  to  the  supply  field  to  understand  so—  selected 
supply  management  indicators.  With  this  understanding,  the  officers  should 
be  capable  of  —king  rational  decisions  earlier  in  their  supply  assign— t 
and  bee— a  a  more  effective  contributor  to  the  supply  support  mission. 

Our  inventory  —nag— it  actions  relate  to  —  the  trade-offs  in 

costs  for  holding  inventory,  costs  of  ordering  inventory  and  out  of  stock 
costs.  «■  most  effectively  iss—  its—  requested  fie—  stock  and  if  no 
stock  exists,  —  —at  have  a  timely  ordering  system  to  respond  to  the 
custom ar  needs.  For  the  most  part,  managers  at  all  levels  no  longer  expect 
the  supply  system  to  meat  every  need  with  e  100*  issue  rate  fie—  stoek. 

The—  managers  real!—  the  cost  of  holding  stock  is  of  such  magnitude  that 
it—  stookage  occurs  only  whoa  justified  by  pest  il—nila  or  when  justified 
by  special  projects,  pragmas  or  ml— loo  changes. 

Bass  level  etvply  is-  essentially  the  retail  level  in  our  distribution 
system,  it  is  a  node  in  a  larger  system  with  supply  sources  of  OCA,  Army 
depots,  navy  depots,  G8A,  and  Air  For—  depots  being  * fie  wholesale  level. 

The  objectives  of  the—  different  levels  of  supply  are  essentially  the  earn, 
narinon  customer  support  while  considering  the  ooet  tradeoffs  of  inventory 
management  — d  minimising  tbs  total  b— border  days  whan  items  not 

in  stock,  noohordors  will  ooour  bees u—  it  is  not  physically  and  economically 
possible  to  bold  in  whreho— a  inventory  nil  possible  east—  requirements. 

Managing  the  supply  system  requires  maaageman t  attention  in  two  ere—. 
Control,  1—  and  rsplsnlsh— t  of  — rsbou—  inventory  and  requisition 
control  and  iss—  of  rmstons r  —Is  for  its—  not  in  stock.  Tbs  use  of 
management  indicators  la  the—  two  ere—  la  extra— ly  helpful  la  deter¬ 
mining  « fie  degree  of  success  —  failure  of  the  distribution  system  in 
reap— as  to  cm— on—  demands.  The  management  indicators  should  be  derived 
fie—  data  soaro—  that  are  readily  avellahle,  pres— t  valid  data,  be  limited 
te  the—  indicate—  that  a—  —pessary,  and,  ,of  seer—,  be  understood  and 
easily  comm— looted.  9—  indicators  fieeqssatly  used  by  Major  Cos— ds  end 
has—  warn  seleotad  for  analysis.  Tbs  indicators  a—  —  follows  t 

1.  Custom—  Pupport  Effectiveness 

2.  ftockage  Effective— 


3.  Met  Mission  Capable  Supply 

4.  Priority  Requisition  Rato 

5.  Ranch  Stock  Support 

6.  Overall  Inventory  Accuracy 

7.  Delinquent  Doeunant  Rate 

8.  Unauthorised  Bquipment 

9.  Serviceable  Balance  -  Ho  Warehouse  Location 

10.  Repair  Cycle  Support  (Delayed  Repairs) 

Each  indicator  will  be  examined  in  the  following  pages  and  specific 
questions  will  be  answered.  Why  is  the  indicator  iaportant?  How  is  the 
indicator  determined?  what  management  actions  should  be  taken  to  improve 
an  unfavorable  indicator  tread? 

1.  CUSTOMER  SOPPOET  SS3SSSBBSL 

IMPORTANCE  OP  IHDICATORi  Customer  support  is  the  sole  reason  for  the 
ekistance  of  a  Base  Supply.  The  support,  however,  must  be  provided  within 
the  tradeoffs  previously  Mentioned  and  the  maeerous  constraints  imposed  by 
higher  level  supply  directives. 

This  indicator  shows  a  percentage  of  issued  node  in  relation  to  total 
requests  for  items  that  have  bean  assigned  stock  levels.  It  does  not  refer 
to  data  analysis  of  requisitions  for  items  that  did  not  have  a  stock  level. 

Past  ilneanrts  for  an  item  have  justified  the  need  for  e  stock  level  or  a 
special  level  may  have  been  justified  and  established,  m  either  case,  a  stock 
level  existed  and  a  balance  was  not  available  for  issue,  thus  asking  a  back¬ 
order  necessary.  This  situation  causes  two  requisitions  for  the  sene  line 
item.  One  requisition  for  the  crust  bust  and  the  requisition  that  already 
existed  for  stock  i^qilmitsT— nr  It  is  also  possible  that  the  requisition 

for  the  truer amsr  could  be  in  a  requisition  Priority  Group  1  or  2  and  there¬ 
fore  odd  to  the  numerous  priority  requisitions  within  our  uniform  Material 
Movement  and  Issue  Priority  System.  Customer  support  effectiveness  directly 
reflects  the  ability  of  supply  managers  to  manage  warehouse  inventories,  it 
is  obvious  that  lew  osstemmr  mspport  effectiveness  asy  degrade  en  organisations 
capability  to  support  its  mission. 


nays  sooRGSi  Customer  support  sffsetivsnsss  data  is  obtained  from  tbs  UWIVAC 
1050-n  B>)2  (Monthly  Bass  Supply  Management  Report) .  This  report  breaks  out 
each  type  organisation  CAPW  44-1,  civil  Magineers,  vehicle  Maintenance,  and 
others)  separately  as  well  as  provides  an  overall  percentage  of  customer 
iqpeet  effectiveness.  The  **24  (Organisation  iffeotiveness  Report)  breaks 
out  oaeh  orgoniaetlou  code  separately  end  ellowe  the  review  of  separeto  shop 


within 


FORMULA: 


Customer  Support 
Effectiveness 


mamagehemt  actions  that  cam  be  taken  to  REVERSE  AM  UNFAVORABLE  TREND: 

It  is  no  snail  undertaking  to  improve  an  unfavorable  trend  of  this  indicator. 
The  trend  reflects  a  sunnary  of  thousands  of  variables  within  the  supply 
system.  It  also  reflects  the  support  given  to  all  organisations  on  a  base, 
perhaps  hundreds. 

A  good  way  to  start  in  analysing  this  indicator  is  to  select  an  organi¬ 
sation  that  is  waking  a  significant  negative  contribution  and  then  review 
those  lteas  with  the  lowest  issue  rate.  The  M-24  report  can  be  used  to 
identify  Hie  organisation.  Identification  of  individual  stock  nunbered 
items  can  be  accomplished  by  asking  the  organisation  for  a  list  of  all 
System  Requirement  Designators  (SRD's)  they  use  when  calling  in  requests  to 
Demand  Processing.  Six  to  ten  SRD's  per  organisation  is  normally  the  maxi¬ 
mum  an  organisation  will  have.  Review  of  the  SRD's  with  the  Base  Supply 
File  Maintenance  Section  is  necessary  to  insure  that  the  applicable  SRD's 
are  loaded  into  Hie  computer  data  bank.  After  verification,  request  the 
Automatic  Data  Processing  Equipment  (ADPE)  Unit  to  process  the  SRD's 
through  the  R-37  report  (SRD  Demand  Data  Analysis)  using  the  current  D-13 
report  tape  (Daily  SRD  Update).  The  R-37  report  will  output  oards  for 
every  stock  i—hcr  requested  using  the  SRD's.  The  R-37  report  oards  can 
be  input  to  the  R-29  report  (Problem  Item  List)  to  produce  a  computer  print¬ 
out  for  manual  review  of  item  history.  A  thorough  examination  of  Hie  data 
history  (demands,  levels,  order  and  ship  tine)  on  each  problem  stock  num¬ 
bered  item  needs  to  be  accomplished.  As  problems  are  found,  positive  long 
range  solutions  need  to  be  lnplaamnted. 

As  you  can  see,  improving  customer  support  for  stocked  items  is  a  time 
consuming  task.  The  analysis  of  the  problem  nay  require  numerous  hours  of 
review  for  each  organisation.  Even  though  locating  the  problems  is  difficult, 
developing  long  range  solutions  and  applying  quality  control  management  to 
the  solutions  will  be  even  more  time  consuming  and  difficult.  The  proce¬ 
dures  described  are  extremely  effective.  At  one  Air  Force  Base  a  stockage 
effectiveness  rate  of  65%  was  Improved  to  87.79%  in  only  seven  months  by 
use  of  these  procedures. 

The  base  supply  manning  is  not  sufficiently  large  to  implement  the 
above  suggestions  for  managing  all  items  that  degrade  customer  support.  An 
over  aggressive  effort  to  improve  this  indicator  will  divert  subordinates 
attention  from  other  duties  and  negatively  affect  the  supply  account  in 
other  areas. 


Total  issues  from  authorised  stock 
Total  issues  from  items  authorised 
stock  +  due  outs  with  due  ins  of 
items  authorised  stock 


IMPORTANCE  or  IHDICAT<*I  N»st  of  our  baaes  have  a  flying  mission  and  com- 
nunications  equipment  that  require  supply  support.  Thaaa  weapon  systems  aro 
not  only  high  coat  but  ara  also  critical  to  tha  dafanaa  of  tha  Dai  tad  s tat as. 
Monitoring  an  indicator  that  follows  waapon  ay at am  support  is  halpful  in 
knowing  tha  succass  of  supply  to  support  thass  aaaantial  requirements .  in 
waapon  systsms  support,  tha  number  of  supply  aourcas  (depots)  is  1 ini tad. 

This  permits  our  supply  parsonnol  to  coneantrata  managamant  af  forts  on  a  faw 
dspots  and  obtain  axtraowly  halpful  coaputar  reports  advising  of  depot  support. 

DATA  SOUSCS t  Tha  M-ll  (Waapon  System  Support  Bffaetivanass  Report)  provides 
the  data  for  this  indicator. 

FORMOSA:  Pareantagas  ara  conputad  by  dividing  line  items  issued  plus  line 
items  backordered  minus  line  itsms  B/0  not  authorised  stock  into  line  items 


MAWWMT  ACTIONS  THAT  CAM  SS  TAKEN  TO  REVERSE  AN  UNFAVORABLE  TREND:  The 
stoefcage  effectiveness  indicator  follows  the  percentage  of  total  requests 
for  itasH  that  ware  filled  from  base  stocks.  Tha  indicator  reflects 
overall  weapon  system  support  and  permits  managsmant  to  isolate  sources  of 
supply  that  are  contributing  to  good  as  wall  as  peer  support.  Individual 
stock  numbered  items  causing  low  effectiveness  can  ha  isolated  and 
thoroughly  esant  nod.  Irani  nation  would  consist  of  analysis  of  order  and 
ship  tins, ,  followups,  raj  eat  rota#  use  of  past  number  loaded  in  the  oon- 
pater#  receipt  analysis#  demand  levels,  and  perhaps  analysis  of  scan  mis¬ 
sion  ntisngs  that  might  base  occurred  panning  a  variance  in  demand  patterns. 

In  analysing  ordsr  and.  ship  ties  it  is  Jape  ift  sal  to  knew  haw  often  tha 
hutanmtai  Data  Proesssiag  Rqulpswnfr  (ADPB)  Molt  is  rslsvsling.  Tha  M-32 
report  nay  shew  30  to  50  times  a  month  hat  this  ootid  bo  inflated  by  tha 
AfifS  omit  rslsvsling  5  to  10  times  on  slow  days.  Tbs  ADFB  unit  has  a 
requirement  to  asaplsta  ralavaling  each  eomputar  dabs*  Daily  aooospliah- 
susit  af  rslsvsling  is  vary  iRpamnt  since  tbs  urgency  of  need  *B"  and  "C" 
requisitions  (includes  shook  replenishment)  account  for  as  much  as  90*  of  all 
requisitions  placed.  Proper  rslsvsling  action*  lead:  to  bulldlap  stocks  and 
therefore  permits  a  higher  percentage  of  issues,  when  issues  are  mads, 
customers  will.  1mm  fane*  cancellation*.  -  Cancellations  ara  another  area 
rated  la  supply  asoagamimt.  • 


Aaqalslttgm  foil  aw  sf  is  another  iapertant  factor  in  stock  management, 
the  ASPS  omit  is  required  to  oosplota  requisition  follow-up  at  least  twice 
a  mnntfi  If  batk  follow-ups  am  oomplstsd  oa  ths  same  day  sad  early  in  tha 
msmth#  it  is  ahvioaa  that  valuable  valid  dtatue  will  net  be  available  to 
sngply  managers.  When  using  this  method,  many  requests  will  net  mast  the 
time  frame  for  follow-up  mod  consequently  they  will  be  overlooked  for  as 
lemg  as  30  diura*  The  stock  control  fait  will  he  forced  to  use  naauolly 
prepared  fsllow-Mp  dec— ate  which  require  mawroua  nanhours.  With  pcopar 
amsapansat#  tha  osupaesr  could  havu  acooaplishod  tha  iollow-«pe  with  no 


manhour  expenditure .  Follow-ups  Should  be  accomplished  the  first  and  third 
week  of  each  month*  If  time  is  available,  follow-ups  cou  d  be  completed 
daily  just  after  releveling  is  complete .  This  action  will  provide  the 
Stock  Control  Unit  with  the  most  current  status  each  day.  The  computer- 
program  will  not  allow  follow-up  on  the  same  item  every  day.  Follow-up 
will  be  accomplished  only  on  those  items  that  are  past  due  the  follow-up 
time  frame. 

Computer  inputs  that  add  to  the  "Reject  Sate"  may  also  negatively 
impact  on  Stockage  Effectiveness.  Rejects  that  are  not  promptly  corrected 
and  re- input  cause  delays  in  the  order  and  ship  time.  Therefore,  rejects 
must  be  cleared  in  a  timely  manner  to  avoid  degrading  stockage 
effectiveness • 

A  significant  improvement  in  use  of  the  UNXVAC  1050-11  capabilities  has 
been  the  addition  of  item  part  numbers  to  item  records.  Nearly  every 
National  Stock  Number  has  a  part  number  and  loading  the  part  number  in  the 
computer  greatly  reduces  requisition  processing  time.  With  the  part  number 
loaded,  maintenance  personnel  can  call  in  their  request  by  part  number. 

The  computer  converts  the  part  number  to  a  stock  number  and  causes  an  issue 
or  backorder.  The  key  to  this  process  is  to  correctly  load  the  part 

number,  and  Technical  Order  (T.O. )  figure  and  index.  After  the  loading  of 

an  items  data  is  complete ,  manual  research  for  requisitions  will  no  longer 

be  required.  All  bases  may  not  be  taking  full  advantage  of  the  computer's 

capability.  An  aggressive  program  to  load  part  number  data  will  yield 
imediate  and  long-term  customer  support  benefits.  Delays  in  requisition 
research  and  processing  will  be  greatly  reduced.  An  effective  way  to  build 
a  computer  data  base  is  to  ask  maintenance  personnel  to  provide  part  number 
and  stock  number  data  at  the  time  of  request  call-in.  Initial  building  of 
the  data  base  in  this  manner  is  slow  but  very  rewarding  in  the  long  run. 

Delays  in  receiving  items  for  stock  will,  of  course,  also  affect 
"Stockage  Effectiveness".  The  M-32  report.  Source  of  Supply  Summary, 
Receipts  nummary,  will  identify  problem  sources  of  supply.  This  report, 
long  with  the  QOS  Routing  Identifier  Listing,  will  further  identify 
supply  sources  that  need  to  be  reviewed.  The  S29,  Problem  Item  List,  can 
then  be  used  to  selectively  obtain  item  record  data  from  the  computer.  The 
M-32  and  QOS  reports  are  not  good  tools  to  work  with  if  the  ADPB  Obit  is 
not  processing  status  daily  or  the  Receiving  Section  is  behind  in  pro¬ 
cessing  receipts. 

In  reviewing  demand  levels  you  may  find  maximum  levels  established  by  a 
depot  Inventory  Manager.  This  occurs  because  the  quantity  of  a  line  item 
is  critically  short  in  the  supply  system.  Because  they  are  short,  bases 
cannot  be  permitted  to  hold  the  high  stock  levels  that  demands  may  justify. 

Aggressive  management  of  this  indicator  will  prove  helpful  in  main¬ 
taining  or  achieving  a  high  level  of  support.  The  importance  of  Weapon 
System  stockage  effectiveness  cannot  be  overstated.  Constant  review, 
follow-up,  and  coordination  of  requirements  are  essential  to  maintain  effec¬ 
tive  mission  support. 


3.  NOT  MISSION  CAPABLE  SUPPLY  (NMCS) 


IMPORTANCE  OF  INDICATORS:  It  is  very  important  to  know  the  down  time 
of  the  major  item  in  a  weapon  system  due  to  the  inability  of  the  supply 
system  to  respond*  When  items  are  not  immediately  available  for  issue  from 
warehouse  stocks  one  of  several  actions  would  take  place: 


a*  Review  R26  Due  In  From  Maintenance  (DIFM)  Listing  and  D19  Awaiting 
Parts  (AMP)  Validation  Listing  for  status  of  item  in  DIFM. 

b.  Withdraw  the  item  from  the  War  Reserve  Spares  Kit  if  available. 

c.  Maintenance  may  cannibalize  the  required  item  from  another  major 
item. 

d.  Requisition  the  required  item  from  the  appropriate  depot. 

e.  Locate  the  required  item  at  another  base  and  have  the  item  shipped. 
This  can  be  accomplished  after  receipt  of  delayed  status  on  the  requisi¬ 
tion.  Lateral  support  action  causes  double  shipping  costs,  since  in  most 
cases  the  base  that  shipped  the  item  must  now  also  requisition  their 
shortage . 


One  of  these  actions  would  eventually  resolve  the  item  shortage. 

Another  important  factor  relating  to  NMCS  is  the  affect  the  priority 
requisitions  have  on  the  Uniform  Materiel  Movement  and  Issue  Priority 
System  (UMMIPS).  If  it  becomes  necessary  to  acquire  the  item  from  lateral 
support  or  a  depot,  a  requisition  in  Priority  Group  1  will  be  created, 
this  high  priority  enjoys  "immediate"  communication  and  air  shipment  of  the 
item.  As  the  distribution  system  receives  a  higher  percentage  of  these 
priority  1  requests,  the  system  becomes  more  costly  to  operate  and  subject 
to  request  processing  delays  at  all  priority  levels.  NMCS  rates  are  affected 
by  new  demands  (items  not  authorized  stock),  the  age  of  the  weapons  system, 
depot  support  and  base  supply  management  of  items  authorized  stock. 

DATA  SOURCE:  Data  for  this  indicator  is  obtained  from  listings  maintained 
by  the  Materiel  Control  Unit  of  the  maintenance  activity  and  NMCS  data 
identified  in  D23  Mission  Capable  (MICAP)  Report. 


FORMULA:  NMCS  Rate  ■  Number  of  NMCS  Hours _ 

Number  of  Available  Weapon  System  Hours 


The  key  factor  is  "NMCS  hours"  rather  than  "NMCS  occurrences".  Once  the 
initial  NMCS  request  is  unsatisfied,  the  rate  is  dependent  upon  the  ability 
of  the  distribution  system  to  react  to  the  NMCS  requisition. 


MANAGEMENT  ACTIONS  THAT  CAN  BE  TAKEN  TO  REVERSE  AN  UNFAVORABLE  TRENDS  A 
NMCS  rate  will  exist  because  it  is  not  economical  to  have  stock  levels  for 
every  item  that  will  require  repair  or  replacement.  However/  improving  an 
unfavorable  trend  will  require  agressive  continuous  work.  In  date  raining 
the  reason  for  out  of  stock  situations/  there  is  frequently  a  requirement 
for  extensive  review  of  supply  data  and  constant  follow-up  by  supply  per¬ 
sonnel.  Each  requisition  for  an  itea  to  resolve  a  NMCS  requires  constant 
monitoring.  In  addition  to  monitoring  existing  NMCS  conditions/  supply 
personnel  oust  determine  if  the  NMCS  rate  is  so  high  that  a  general  review 
of  weapon  system  stock  levels/  inventory  procedures/  order  and  ship  time  or 
other  supply  management  areas  require  analysis  to  identify  «Jkortcosdngs  in 
the  system. 

Often  times  problems  can  be  found  in  computing  the  NMCS  rate. 

Therefore/  coordination  with  maintenance  may  be  necessary  to  be  assured 
that  the  "Number  of  Available  Weapon  System  Hours"  are  correct  and  that 
NMCS  start  and  stop  times  are  being  logged  in  correctly.  Maintenance  can 
also  be  of  considerable  assistance  in  reducing  the  NMCS  rate  by  performing 
cannibalization  action  when  appropriate . 


4.  PRIORITY  REQUISITION  RATE 


IMPORTANCE  OF  INDICATOR:  As  you  would  expect,  priority  requisitions 
within  our  logistics  distribution  system  receive  more  management  attention 
than  routine  requisitions.  This  is  because  of  the  procedures  established 
by  the  Uniform  Materiel  Movement  and  Issue  Priority  System  (UMMIPS). 

UMMIPS  establishes  criteria  whereby  requisitions  can  be  categorized  and 
processed  according  to  a  priority  that  is  relative  to  the  requesting 
activities'  military  mission  importance . 


DOD  governs  the  requisition,  issue,  and  movement  of  all  materiel 
managed  by  its  separate  service  components,  and  through  agreement,  those 
materiel  items  furnished  to  the  components  by  General  Service  Administration 
(GSA) .  DOD  has  also  directed  that  each  component  maintain  a  priority 
system  as  a  management  tool  for  the  proper  allocation  of  materiel  resources 
within  criteria  that  has  been  uniformly  established. 


DOD  has  prescribed  five  categorizations  and  the  criteria  for  assigning 
each.  A  categorization  is  called  a  Force/Activity  Designator  (FAD)  and 
identifies  all  U.  S.  and  certain  foreign  assistance  pact  countries  as  to 
their  respective  places  in  the  spectrum  of  military  essentiality. 


While  the  FAD  is  very  important  in  the  establishment  of  priorities  it 
is  but  one  of  two  factors;  the  other  being  the  "urgency  of  need”  designator 
(UND).  The  UND  is  a  means  by  which  a  requesting  organization  can  convey 
the  degree  of  urgency  of  its  materiel  needs.  When  the  requesting 
organization's  FAD  is  combined  with  the  UND  it  assigns  a  materiel  require¬ 
ment  a  "Priority  Designator"  (PD). 
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Priority  Designators  are  a  series  of  numeric  codes  from  01  to  15  that 
are  used  to  categorize  the  relative  importance  and  urgency  of  requests. 

When  the  requests  are  categorized  and  refined  through  a  base  supply 
requisitioning  program,  requisitions  are  prepared  to  the  higher  level 
supply  source.  It  should  be  noted  that  the  KELSTRIP/UMlIPS  criteria  apply 
only  to  those  requisitions  that  leave  the  base  to  the  Inventory  Manager  (IM) 
System  Manager  (SM) ,  DLA,  GSA  or  another  military  department. 

The  requisitions  are  segregated  into  three  priority  groupings  to  faci¬ 
litate  handling  and  processing.  Bach  group  has  established  guidelines  and 
time  limits  for  every  step  in  the  requisition  processing.  Priority 
groupings  govern  the  type  of  communications  media  and  priority  used  for 
requisition,  status  information,  follow-ups  and  replies t  the  speed  at  which 
the  supply  source  processes  the  requisitions;  the  materiel  selection  and 
issue  criteria;  and  the  selection  of  the  mode  of  transportation  used  to 
ship  the  materiel  to  the  requesting  agency. 

Selection  of  the  proper  UND  is  the  responsibility  of  the  Commander  of 
the  requesting  activity.  By  conscientiously  weighing  OND  alternatives, 
individuals  can  insure  that  demands  which  have  the  most  serious  impact  on 
mission  capability  will  receive  priority  processing  and  handling  through 
the  entire  requisition  cycle.  The  UMMIPS  is  designed  for  selective  use  of 
priorities  based  on  predetermined  factors.  Automatic  and  uncontrolled 
assignment  of  priorities  is  an  abuse  of  the  system,  and  as  such  cannot  be 
condoned. 

The  wrong  OND,  once  assigned  by  the  user,  becomes  the  prime  contributor 
to  the  abuse  of  the  system.  Whenever  the  user,  who  could  very  likely  be  a 
mechanic  on  the  line,  determines  how  detrimental  the  lack  of  an  item  is  to 
the  operational  support  of  a  system,  a  sequence  of  significant  events  is 
started. 

After  approval  to  requisition  an  item  and  the  priority  has  been  cer¬ 
tified,  the  requisition  leaves  the  base  by  high-priority  communication;  the 
depot  processes  the  request,  takes  the  needed  item  from  stock  and  packs  the 
order  for  shipment;  and  the  transportation  section  ships  the  materiel  by 
the  most  expeditious  method  consistent  with  the  priority  group.  Bach  of 
these  actions  is  extremely  costly,  but  considered  worth  the  expense  to  get 
the  needed  item  to  the  user  in  minimum  time.  If  the  priority  is  legiti¬ 
mate,  the  expense  can  be  justified;  if  not,  the  assignment  of  the  incorrect 
priority  results  in  senseless  expenditures,  even  if  the  mistake  is  made 
only  once. 

DATA  SOOBCEs  The  Priority  Requisition  Rate  Data  is  identified  in  the  M-32 
Report  for  the  month  being  reviewed.  The  Q05  (Routine  Identifier  Listing) 
adds  each  month  together  for  a  total  FY  1  Oct  -  30  Sep  review. 

FORMULA;  Overall  total  Priority  Group  1  and  2  requisitions  and  special 
requisitions  divided  by  overall  total  Priority  Group  1,  2,  and  3  requisi¬ 
tions  and  special  requisitions. 
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MANAGEMENT  ACTIONS  THAT  CAN  BE  TAKEN  TO  REVERSE  AN  UNFAVORABtt  TREND:  All 
organization  and  unit  coanandars  have  an  inherant  responsibility  to  exor¬ 
cise  the  degree  of  supply  Management#  surveillance#  and  discipline  to 
enforce  the  concepts  and  procedures  of  the  UMMXPS  system.  Without  a  high 
level  of  coosuuid  interest#  the  logistics  system  cannot  satisfactorily 
respond  to  the  unit's  legitimate  materiel  needs.  Some  of  the  steps  the 
commanders  can  take  to  establish  proper  supply  management  procedures  and  to 
enforce  the  discipline  are  to: 

(1)  efficiently  plan#  forecast#  and  submit  requests  for  the  materiel 
needed  with  as  much  lead  time  as  possible.  This  is  basic  to  allowing  the 
supply  system  to  plan  for  support  instead  of  reacting  to  an  emergency. 

(2)  establish  a  system  whereby  each  UNO  "A"  request  is  properly 
reviewed  and  that  the  required  delivery  dates  (RDD)  and  the  quantity  of 
materiel  required  reflect  realistic  and  valid  factors:  and  request  only  the 
quantity  of  materiel  needed. 

Although  the  responsibility  of  enforcing  supply  discipline  is  that  of 
the  unit  commander#  in  the  final  analysis  it  is  the  Chief  of  Supply  who 
must  see  the  commander's  directions  are  being  followed. 

One  way  the  Chief  of  Supply  can  help  to  achieve  correct  adherence  to 
UMMIPS  procedures  is  by  Informing  each  of  his  customers  of  the  restrictions 
and  operating  procedures  that  control  the  degree  of  supply  support  that  aan 
be  given  and  the  ways  in  which  the  customer  can  assist  to  Improve  the  system's 
reaction.  For  example,  once  the  customer  understands  the  prevailing  order 
and  shipping  time  the  more  likely  he  is  to  submit  requests  on  time. 

Further#  the  customer  should  be  advised  to  avoid#  or  at  least  minimise# 
excessively  high  priority  requisitioning.  The  highest  degree  of  cooperative 
materiel  planning  and  forecasting  by  the  Chief  of  Supply  and  the  customer 
is  basic  to  maintaining  a  well  balanced  stock  position.  In  the  requests 
the  customer  submits  to  supply,  realism  must  be  reflected  not  only  in  the 
quantities  but  also  In  the  urgency  of  need.  The  capability  of  the 
logistics  system  to  react  satisfactorily  to  all  valid  mission  requirements 
depends  largely  on  the  validity  of  the  priorities  assigned  by  the  customer. 
When  the  majority  of  the  requisitions  within  the  distribution  system  are 
priority#  then  essentially  all  requisitions  become  routine. 

5.  BENCH  STOCK  POE  OPTS 

IMPORTANCE  OF  INDICATOR:  The  importance  of  the  bench  stock  system  is 
often  not  understood  by  supply  people.  A  maintenance  weapon  system  bench 
stock  is  the  "life  blood”  of  the  base  level  repair  cycle.  Base  maintenance 
facilities  are  the  best  source  of  supply  for  major  end  items  of  a  weapon 
system.  You  can  expect  that  a  good  base  maintenance  operation  will  repair 
70%  to  90%  of  all  repair  cycle  items  and  return  then  serviceable  to  base 
supply  warehouse  stocks.  Base  supply  has  a  critical  input  to  this  repair 
cycle.  They  must  make  available  to  maintenance  those  small#  usually  low 
cost,  high  consumption  items  that  are  necessary  to  make  end  item  repairs. 

When  stocks  are  not  available#  the  repair  of  the  major  end  items  will  be 


Zt  is  also 


delayed.  The  itw  nay  be  asst  to  the  lyat  ter  repair  Bsaslsf  a 
and  tha  flow  of  as  additional  itso  is  the  distribution  uystaa. 
obvious  that  as  affieisst  rapair  cycle  at  bass  level  will  radaoa  tbs  MCS 
rata  baeansa  norv  aarvioaabla  aaaata  will  ba  available  ter  1  ana  if  i  ate  iasoa 
to  euetoawrs. 

0 ATA  soonest  Tha  "Bench  stock  Beanery"  papa  of  tho  IHS2  provides  tills  data. 


WOI»  Soneh  Stock  Ooo  Out  Sato  -  Total  Lias  Itaaa  Poa  Out  (over  U  days) 
(Da layed)  Total  Lisa  Ztosa  bathoriaad 

NWMBBR  ACTXOUB  OAT  CAN  M  WV  TO  Willi  Ml  OMAVOkABUt  TRSMDi  Tha 
whole  story  is#  of  coarse,  sot  told  by  this  Indicator.  Soaa  bonoh  stock 
lias  itaao  aro  aora  iaportaat  than  others.  Sosm  lisa  itaas  on  backorder 
aifht  ba  far  a  qeaatity  of  two  or  three  salts  or  a  thoosaad  salts  or  acre. 
Soaa  base  baa  ah  statin  directly  support  a  weapon  systea  where  as  other 
beach  statin  nay  asps  art  Graphics,  Civil  Baginssrs,  Transportation,  fhoto 
Lab,  etc*  ill  baaeh  stocks  arc  iaportaat  and  require  constant  aanapaaant 
asrvelllaaee  by  baas  supply. 


1  theroaph  saSerstaaflsp  of  how  tin  bench  stock  ayataa  is  supposed  to 
work  and  the  preeedaree  ter  sddinp  and  daletinp  bench  stock  iteas  is 
necessary  before  efforts  oaa  ha  takaa  to  laprove  an  anf avorable  "Bench 
Stock  One  Oat  late."  Baaeh  stock  itaaa  are  ordered  trn  a  variety  of  sour¬ 
ces,  i.e.,  local  parohaoa.  Ml,  DLL,  as  well  as  fron  allitary  depots.  When 
perfaraiap  analysis*  It  Is  asoeaaary  to  look  at  hundreds  of  variables.  A 
pood  place  to  start  in  raaolvlap  probleas  is  by  identifying  those  bench 
stocks  that  aro  saatribatlap  tin  neat  to  tin  worsoninp  trend,  a  review  of 
your  N*>M  Orpaaleetlca  Bffectiveaeas  Baport  outlines  line  itaaa  authorised 
and  year  supply  effectiveness  by  organisation  code.  A  personal  visit  to 
the  bunch  stocks  being  reviewed  and  discussions  with  tin  shop  aanapers  is 
neeasosry  and  often  aaliphtaalnp.  Likewise,  discussions  with  Base  Supply 
beach  stack  auaagars  tinuld  also  be  helpful  in  understanding  tha  probleas 
with  a  particular  beach  stock. 


it  efforts  osa  be  placed  on  specific  line  itaas  tint 
til  and  requisition  follow-up  an  each  line 
in  an  lapmvort  bench  stock  due  out  rate. 


XMMMMCi  0 W  nozcATPht  One  of  the  aoet  iaportaat  indicators  of  Chief  of 
•apply  psrfocaaaoo  is  Inventory  Accuracy.  The  serviceable  balance  field 
ea  the  aoapuisr  itaa  reoord  is  tin  basis  for  two  uoaputar  dacision-whathar 
or  not  to  issae  assets  to  satisfy  a  oust oner  request  and  when  to  create  a 
stock  rcplanlstnant  requisition.  Zf  tin  actual  warehouse  balances  differ 
free  itaa  reoord  balances,  these  decisions  will  be  incorrect.  Xn  one  ease, 
a  warehouse  refusal  will  be  generated  and  ouetoaer  confidence  in  tin  supply 
support  abilities  will  be  lapaired.  In  tin  other  ease,  tin  etoek  reple- 
nishauat  requisition  will  be  delayed  and  tin  probability  of  an  ensuing  aero 
balance  will  increase. 


Data  is  available  on  tbs  M-32,  Monthly  Bass  Supply  Managauant 


10— Hit  Mo  external  oouputation  for  this  indicator  is  necessary,  but 
under  progran  control  total  units  ovar/short  v  raoord  balance  -  %  that  is 
nines  fron  100%  ■  Inventory  Xc curacy. 

MMBIfi— t  XCTIOMS  TO  TAXI  TO  IMPROVE  AM  ONTAVORABUE  INDICATOR  TREND*  Poor 
inventory  accuracy  presents  a  serious  supply  situation.  Many  factors  iapact 
on  this  indicator,  i.a. ,  pilferage  fron  warehouse  bias  or  the  receiving 
line,  itens  la  the  wrong  warehouse  location,  quantity  oount  incorrect  by  the 
leeeivlag  Section,  incorrect  oount  by  the  Inventory  Section  during  Cycle 
or  Sanple  Inventory  and  inventory  counts  correct  but  oosputer  record 
edjustasnto  incorrect  are  seas  of  the  aajor  problen  areas  that  require 
review.  Saan tines  problem  with  inventory  accuracy  can  be  isolated 
to  a  single  stockroon.  This  nay  indicate  poor  stoekroen  warehouse  aanage- 
mat.  Also,  it  could  be  that  the  stockroon  has  gone  an  extended  period 
without  inventory.  In  som  eases,  a  wall  to  wall  stockroon  inventory  nay 
be  necessary.  Som  itaa  records  nay  exceed  365  days  without  inventory,  this 
needs  review  and  correction,  individuals  on  the  night  shift  nay  be  asking 
issues  without  issue  doounents  or  carelessly  pulling  stock  and  returning 
stock  into  an  incorrect  bin.  The  quality  of  night  work  can  be  checked  by 
requiring  posting  of  issues  to  a  log.  The  next  day,  inquiries  can  be  node 
for  stock  balance  and  warehouse  locations  physically  checked.  Warehouse 
security  my  also  require  review  to  be  assured  that  unauthorised  personnel 
do  not  have  access  to  the  warehouse. 

The  aonthly  consolidated  Inventory  Adjustmnt  Docunant  Bagister  M10 
categorises  the  type  of  diserupaacies  identified  so  that  corrective  action 
can  be  Initiated.  If  the  nature  of  a  discrepancy  indicates  the  need  for 
reporting,  it  will  be  expeditiously  furnished  to  the  OBI  Detachmnt,  with 
lnfoxaetlon  copy  to  Chief,  Security  Policy,  to  detemine  if  there  is  probable 
cause  to  stupect  theft  and  if  an  investigation  is  required  XAM  Alt  125-21. 

The  inventory  section  is  responsible  for  obtaining  supporting  docunantation 
and  attaching  this  docunantation  to  the  M10.  Oonpletad  causative  research 
wort  sheets  or  inventory  registers  containing  a  statamnt  by  the  Inventory 
section  that  adequate  research  has  been  parfomed  and  evidence  of  theft, 
fraud  or  nisdsmanor  (does)  (does  not)  exist. 

7. 

IMPORTANCE  OP  INDICATOR i  The  Delinquent  Docunant  Rata  is  a  necessary  aanage- 
mnt  tool  and  a  good  indication  of  the  overall  condition  of  a  stock  record 
account.  Nmerous  supply  transactions  ocour  each  day  that  require  feed¬ 
back  so  mnageaant  can  be  assured  that  transactions  were  ooupleted.  A 
high  Delinquent  Documnt  Rate  normlly  indicates  a  poor  supply  account  with 
unaggreasive  wane tenant.  Selected  transactions  result  in  the  D04  Daily 
Poomant  Register  production  of  a  do  omen  t  oontrol  card  (DOC) .  Delinquent 
Source  Doounents  (D— »  for  Manually  prepared  do  omenta  sueh  as  Post  post 
transactions  nay  experience  delays  in  oosputer  processing.  Tbs  doounents 
are  collected  throughout  each  day  and  given  to  the  Poomant 


Control  Section 


the  mm  day*  Thay  ara  Mtchad  to  the  output  DOC  from  the  D04  the  following 
day*  All  docunanta  do  not  become  delinquent  in  the  same  length  of  time  and 
all  docunanta  do  not  go  into  permanent  file  for  an  audit  trail* 

The  Document  Control  Section  mat  ha  aaaurad  that  each  signed  document 
compares  with  a  DCC  card  before  the  document  is  filed  or  destroyed*  (The 
docunanta  narked  for  ultimate  destruction  are  temporarily  filed  for  15 
days,  since  the  Base  Service  Store  (BBS),  Tool  Issue  Center  (TIC) , 

Individual  equipment  Unit  (IEU)  went  to  line  item  accounting  system)  • 

DATA  sound!  Document  control  cards  are  produced  by  the  computer  when  cer¬ 
tain  tranMctions  occur.  Cards  can  be  used  to  make  a  listing  for  easier 
reading  and  use  by  individuals  throughout  base  supply. 

FORMULA:  Total  documents  overdue  return  to  the  Document  Control  Section 
divided  by  the  total  number  of  Document  Control  Cards. 

MANAGEMENT  ACTIONS  THAT  CAN  BE  TAKEN  TO  REVERSE  AN  UNFAVORABLE  TREND: 
Aggressive  management  throughout  base  supply  is  necessary  to  hold  down  the 
number  of  delinquent  documents*  The  proper  flow  of  documents  must  be  known 
by  the  individuals  handling  documents  and  they  must  give  adequate  emphasia 
to  the  return  of  signed  documents  to  the  Document  Control  Section. 

Documents  may  be  missing  for  a  variety  of  reasons,  i.e. ,  property  awaiting 
signature,  property  not  pulled  from  stock,  property  not  delivered,  property 
issued  and  the  receipt  document  lost,  property  lost,  property  stolen  or 
perhaps  missing  documents  may  be  accumulating  in  an  office  and  consequently 
not  delivered  to  the  Document  Control  Section*  A  thorough  review  of  all 
possible  causes  for  the  missing  document  must  ba  accomplished  before  a  cer¬ 
tificate  of  lost  document  is  completed* 

8.  UNAUTHORISED  EQUIPMENT  (ALLOWANCE  SOURCE  CODE  000  LOWGER  THAN  15  DATS) 

IMPORTANCE  OP  INDICATOR!  Air  Force  equipment  items  are  high  cost  and/or 
pilfer able  type  items  that  require  accurate  control  and  accountability* 

The  maintenance  of  accountable  records  and  retaining  and  maintaining  equip¬ 
ment  is  a  costly  process*  When  equipment  items  are  held  by  custodians 
without  proper  authorisation  an  activity  elsewhere  nay  be  deprived  of  the 
item  while  the  unit  holding  the  item  is  causing  unnecessary  inventory 
accounting.  In  addition,  unauthorised  items  may  be  taking  shop  or  office 
space  that  is  needed  for  another  purpoM*  The  impact  of  an  equipment 
excess  within  a  single  organisation  may  not  be  large  but  from  an  entire 
base  or  Major  Command  standpoint  the  amount  of  unauthorised  equipment  might 
be  very  significant  and  require  aggressive  management  attention  throughout 
the  command* 

DATA  BOURCEi  Major  command  programs  will  normally  be  established  to  pro¬ 
vide  this  data.  Also,  the  Q09  Allowance  Source  Cods  Listing  for  ABC  000 
provides  this  information  for  base  review  prior  to  being  reported  to  the 
data  bank  by  your  Did  Daily  Equipment  TranMOtion  Report  or  NOE  Authorised 
In  Use  Report. 
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FORMULA:  this  —  mg— nt  indicator  is  usually  based  on  the  total  dollar 
value  of  equipment  held  without  authorisation  for  a  period  longer  than  15 
days*  Equipment  say  also  be  determined  by  a  percentage* 

%  of  Equipment  excess  -  ♦  of  units  ABC  000  over  15  days 

TOTAL  units  authorised 


MANAGEMENT  ACTIONS  THAT  CAM  BE  TAKEN  TO  REVERSE  AM  UNFAVORABLE  TREND:  In 
reviewing  this  indicator  it  will  be  helpful  to  identify  the  individual 
iteas  of  concern  and  the  custodian  accounts  on  which  they  were  listed* 

In  many  cases  it  is  helpful  to  learn  the  cause  for  the  excess*  There  say 
have  been  an  error  in  a  revised  Table  of  Allowance  that  incorrectly  caused 
the  excess*  The  itaa  could  have  been  directed  into  a  base  by  a  higher 
headquarters  without  establishing  appropriate  Allowance  Source  Codes  (ASC) . 
Moveawnt  of  a  weapon  system  from  a  base  may  also  create  excesses*  To  ali- 
minate  the  ASC  000. and  hold  the  item,  an  AF  Form  60 1A  with  justificetion 
must  be  processed  to  higher  headquarters.  This  will  permit  assignment  of 
ASC  000A  to  the  computer  record  indicating  that  a  request  for  authorisation 
has  been  submitted.  If  retention  of  the  item  is  not  approved,  then  the 
excess  item  east  be  cleared  by  turn-in.  Occasionally,  iteas  will  remain  on 
ASC  000  for  an  extended  period  because  they  are  extremely  large  or  fastened 
to  the  earth  or  a  facility.  These  kinds  of  items  may  require  e  contract 
with  a  civilian  business  to  disconnect  or  physically  move  the  item. 
Extensive  turn-in  delays  may  occur  causing  the  item  to  remain  as  ASC  000 
beyond  15  days*  Equipment  managers  should  aggressively  resolve  all  ABC  000 
beyond  15  days*  Problems  should  be  elevated  to  appropriate  oowmders  whan 
they  cannot  be  resolved  at  lower  management  levels. 


9*  SERVICEABLE  BALAMCE  -  MO  WAREHOUSE  LOCATION 

ZMMH9ANCB  or  INDICATOR:  Items  with  a  serviceable  balance  and  no  ware¬ 
house  location  are  lost.  Having  a  serviceable  balance  without  a  warehouse 
location  oan  occur  in  a  number  of  different  ways*  Perhaps  a  receipt  would 
be  processed  without  the  item  being  physically  received*  Another  possibi¬ 
lity  is  thst  a  substitute  item  nay  be  received  and  placed  In  the  warehouse 
location  of  the  preferred  item.  A  receipt  could  be  processed  and  the  item 
is  in  the  warehouse  awaiting  location  assignment.  A  receipt  could  be  pro¬ 
cessed  and  the  item  pilfered  from  the  receiving  line. 

The  seriousness  of  this  situation  is,  of  course,  that  the  items  are  in 
stock  for  custoBMMf  support.  If  a  demand  for  an  item  occurs  and  it  cannot 
be  located,  a  warehouse  refusal  results*  If  the  customer  still  requires 
the  item,  a  requisition  through  the  distribution  system  will  be  necessery 
and  delay  customer  support.  Items  ordered  for  War  Reserve  Materiel  (NAM) 
stock  but  not  assigned  a  location  may  degrade  the  readiness  of  the  MM  kit 
This  indicator  is  very  important  because  it  informs  supply  managers  about 
problems  with  the  flew  of  items  without  a  warehouse  location  through  the 
receiving  and  warehouse  sections*  Closely  following  this  indicator  and 
taking  prompt  corrective  action  will  be  helpful  in  heading  off  serious 
problems  and  in  maintaining  an  excellent  supply  account. 


DATA  SOURCE;  M-32  itea  Record  Data 


FORMULA:  Serviceable  Balanca 

Mo  warehouse  location  rata 


Supplies  total  itsa  rooord 


MAMAQMEMT  ACTZ0M8  THAT  CAM  BX  TAKEN  TO  REVERSE  AM  UNFAVORABLE  TREND: 
Reversing  this  unfavorable  trend  requires  quick,  aggressive,  and  positive 
steps*  The  pccblesa  any  be  located  in  the  Receiving  Section*  Perhaps 
there  has  been  an  unusually  heavy  work  load,  a  new  individual  assigned  to 
the  section  or  the  Receiving  Section  supervisor  has  been  absent  for  a 
period  of  tins.  The  type  of  iteas  alasing  aight  be  a  due  that  can  relate 
the  pcoblea  to  a  particular  warehouse  stoekroca  supervisor*  Another  possi¬ 
bility  oould  be  an  excessive  anount  of  ooaputer  downtiae  causing  "Poet  Post 
docuasut  processing*  Zf  an  extensive  Mount  of  incorrect  "Poet  Post"  pro¬ 
cessing  has  occurred,  aany  asnageaent  indicators  will  reflect  an  unfa¬ 
vorable  trend.  In  any  ease,  early  correction  of  this  unfavorable  tread 
aust  be  aecoaplished. 


IMPORTANCE  OP  INDICATOR:  Air  Force  Repair  Cycle 
repairable  type  iteas  required  in  support  of  a  a 


high  cost 


are  axtroaoly  Halted  in  our  suppl 
and  tiaely  repair*  Xtaas  not  rape 
Mission  Capable  Supply  (MNCS)  situ 
we  discussed  earlier,  oausee  an  ad 
systea  and,  of  oourse,  ceases  down 
reflects  the  coordinated  effort  of 
Supply  personnel  east  assure  adsqu 
accurate  location  records*  Nsiata 


ur  supply  systea  sad  require  close  accountability 
not  repaired  in  a  tiaely  aannsr  nay  pause  a  Mot 
CS)  situation  to  develop.  A  MCS  condition,  as 
ss  an  additional  burden  to  *— *  distribution 
sea  dowstlas  for  a  weapon  systea*  This  indicator 
ffert  of  supply  and  aalwtananoa  personnel, 
re  adequate  banchatock  for  repair  and  aaintain 
Maintenance  aust  sake  tiaely  repairs  to  iteas 


and  return  than  serviceable  to 
the  aisslon  success  of  the  weaj 


This  coordination  is  vital  to 


DATA  SOURCE:  M-32  Repair  Cycle  Asset  Control  Data  (all  organisations) 


FORMULA: 


Repair  Cycle 
Support  Rate 


Total  units  repairable  this 
station  over  4  days  APM  64-1 


Til  J  1 


-  Total  Obits  repairable  this 
Station,  ASM  66-1  organisations 
•  total  other  organisations* 


daily  supply-naintsaanee 
repair  cycle  systea  that 
Chief  of  Maintenance  is  < 


RON  TO  RltlRSa  Ml  tEtPAVORASLE  TRENDS  A 
I  is  the  place  to  report  any  probleas  in  the 
be  resolved  at  lower  working  levels.  The 
y  interested  in  the  production  of  his 


—InttBMO  shops  am d  the  support  by  base  supply.  Ha  realizes  that 
excessive  delays  in  rapairing  rapair  cycla  items  may  oausa  an  ineraasa  work 
load  doe  to  cannibalization .  Tha  flying  missile  or  pn— run 1 cations  mission 
could  also  bo  dsgradsd  das  to  nonavailability  of  serviceable  items.  Supply 
managers  need  to  examine  their  document  control  procedures  to  be  assured 
that  adequate  controls  exist. 


stnuutf 

tha  foregoing  is  in  no  way  a  full  treatment  of  all  the  management  indi¬ 
cators  available  or  used  in  Base  Level  Supply  Management.  The  tan  indica¬ 
tors  selected  for  review  are  very  important  to  management  but  there  are 
other  indicators  of  equal  importance •  The  analysis  presented  should  be 
helpful  to  individuals  new  to  the  supply  "business"  in  understanding  the 
supply  system  and  becoming  an  effective  contributor  to  supply  management 
earlier  in  their  supply  assignawnt. 


This  chapter  presents  a  nuaber  of  exaaples  dasignsd  to  illustrate  tho 
application  of  several  fundamental  management  seionea  techniques  to  typical 
bass  civil  engineering  problsas.  Thraa  general  typaa  of  techniques  ara 
included*  (1)  deterministic  mathematical  soda ling  (linaar  prograaaing) , 

(2)  network  analysis,  and  (3)  engineering  eoonoaic  analysis.  The  deter- 
alnistic  aathaaatical  progressing  exaaples  include  linaar  prograaaing, 
transportation,  and  assignaant  probleas .  The  network  analysis  exaaples 
illustrate  closed-circuit ,  shortest  route,  aariaal  flow,  and  FKKT/CFM 
probleas.  The  aoonoaic  analysis  exaaples  illustrate  investment  decision 
probleas.  The  solution  algorithms  applied  in  these  exaaples  are  discussed 
in  other  chapters. 


b*  Dtoiiiop  Variable  (Wj)  i  Weight  in  kips  that  can  ba  carried  by 
hoiet  j  <*-1,2, and  3). 

c.  8avironeantal  Variablea : 

8±i  Load  in  kipa  carried  by  airport  i  (i-1,2,...,5). 

2.  ftgai  8elationahipa 

a  -  wx  ♦  w2  +  w3 

8lf  m2  *  1481’  2/1m2  “  ®1  i  ®* 

S2I  10W2  -  14s2>  5/7W2  -  S2  <  8k  ' 

f3>  5W3  +  10S2  -  12S3  <  16k 

5W3  4  10(5/7W2)  -  1283  <  16k 
5/12W3 4  25/42W2  -  83  <  16k  ) 

I 

84i  282  4  7W3  -  1284  <  16k 

2(5/7W2)  +  7W3  -  1284  $  16k 
„  S/42W2  4  7/12Wj  -  S4  <  16k 

* 

8ss  7W3  +  16*2  4  21W1  -  288s  <  20k 
•  • 

~  *>%%  ♦  4  3/4%  <  20k 

Sgt  7Wj  4  12W2  +  21W3  *  28%  <  20k 
W»L  4  3/7*2  4  3/4%  -  S|  <  20k 


<20k 


L 


LP  Model 

Maximize : 

z  -  w1  +  w2  +  w3 

S.T. 

2/7W2 

<  8 

5/7W2 

<  8 

23/42W2  +  5/12W3 

<  16 

5/42W2  +  7/12W3 

<  16 

3/4Wx  +  4/7W2  +  1/4W3 

<  20 

I/4W1  +  3/7W2  +  3/4M3 

<  20 

*1#  w2,  w3. 

>  0 

Solution 

Algorithm 

Table  2-1 

w2  w3 


-i  I  -i  -1 


S2 

S3 

S4 

S5 

S6 

bi 

2/7 


5/7 


25/42  5/12 


5/42  7/12 


4/7  1/4 


1/4  3/7  3/4 


1  20 


The  Initial  solution  is  not  optimal.  Thera  is  a  tie  for  the  entering 
variable.  Arbitrarily  select  W^.  As  enters,  S5  leaves.  The  new  solu¬ 
tion  iss 


Solution  Is  optimal .  However,  there  are  alternative  optimal  solutions, 
i.e.,  w2  can  be  brought  into  the  basis  without  changing  Z.  If  this  Is 
done,  the  solution  is: 


-  64/6  Z  -  40  -  r 

W2  “  56/5 

W3  -  16 

EXAMPLE  2-2 


A  contractor  will  submit  a  unit-price  bid  on  the  foundation  work  for  a 
large  concrete  structure  to  be  constructed  in  support  of  development  work 
on  the  MX  missile  system.  After  reviewing  the  plans  and  specifications  and 
his  estimate  of  the  competition  on  this  project,  he  sets  a  total  bid  limit 
at  $750,000  (7.5  x  105).  The  government's  estimate  and  the  contractor's 
estimate  of  the  quantities  of  work  involved  are  shown  in  the  table  below. 
The  last  column  of  the  table  is  the  contractor's  estimate  of  the  acceptable 
range  within  which  he  can  submit  unit  prices.  Given  the  estimated 
construction  schedule  in  the  table  and  an  interest  rate  of  10%,  determine 
unit  prices  for  the  three  required  construction  items  which  will  maximize 
the  present  worth  of  his  income. 


ITEM 

1  gw 

.  SST 

I  CONSTR.  SCHEDULE 

UNIT  PRICE 

YR  1 

YR  2 

YR  3 

RANGE 

Earth 

Excavation 

4x10s 

3.5x10s 

2.5x10  s 

$0.80-1 .80/CY 

Rock 

Excavation 

3x10  4 

3. 5x10 4 

O.SxlO4 

$6.00-1 1 .00/CY 

Drilling 

1.8x10 3 

1.8x10  3 

1 .3x10s 

$6. 00- 12. 00/ft 

Solution  Procedure 
A.  System  Analysis 

1.  System  Component 

a.  Criterion  Variable  (Z):  Present  value  in  dollars  of  total 
income;  the  abjective  is  to  maximize  Z. 

b.  Decision  Variables  (Xj):  Unit  price  in  dollars  for  task  i 
(i-1,2,3). 

c.  Environmental  Variables: 


(1) 

ULj_ 

m 

upper  limit  on 

unit  price  for  i 

(2) 

LLj. 

m 

lower  limit  on 

unit  price  for  i 

(3) 

B. 

m 

max.  allowable 

bid  price 

(4) 

I 

m 

interest  rate 

(5) 

m 

amount  of  item 

i  scheduled  for  year  j  (i-1,2,3;  j-1,2,3) 

't—  lUlxtionxhli 
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Hgur> 


:iJ  “  [  (1.10A  (2*5  *  10  )  +  d.io)^  (1*°  x  1C|5)]  xx 

-  [(0.91 >(2. 5x10s)  ♦  (0.83)(1.0x105)]  Xx 

-  (2.275x10s  +  0.83x10s )XX 

-  3.105x10SXx 

31  -  [ - (0.5xl04)  +  - O.OxlO4)]  X. 

hl.10)1  (1.10) 2  J  2 

-  1(0.91)  (O.SxlO4  )  ♦  (0.83)(3.0x104)jX2 

Lj)  -  (0.455x104  ♦  2.49x104)X2 

-  2.945x10*.  %2 
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*1<*3> 


*  2<*1> 
*2<x2> 
*2<*3> 


L (1.10)2  ""  ”  '  (1.10)3 

-  1(0.83) (0.5x10 3 )  +  (0.75 )( 1.3x103)Jx3 

-  (0.415x103  ♦  0.975x103)X3 

-  1.390x10 3 *X3 

-  4.0x105X1 

-  3. 0x10  ^2 

-  1.8x10 


(O.SxlO3)  + 


(1.3x10  )|  X, 


LP  Modal 

Maximize «  Z-3 . 105x10  5X, +2 . 945x10  4X+1 . 39x10  3  «X  • 

1  2  3 


S.T.  4.0x10SXI+3. 0x10  ^j+I.  8x10^  <7. 5x105 

Xx  >  0.80 
Xx  <  1.80 
X2  >  6.00 
X2  <  11.00 
X3  >  6.00 
X3  <  12.00 

Solution  Algorithm 

Max. s  S-310.500XX  +  29,450X2  +  1,390X3  +  08*  +  0S2  -  MA2  * 
0S3  ♦  0*4  -  MA4  ♦  08 5  ♦  0>6  -  MA6  +  0S7 

S.T.  400,000X1  4  30, 00 OX 2  +  1,800X3  +  Sx  -  750,000 

Ij  •  I2  Hj  •  0.80 

Xx  ♦  83  -  1.80 


X 


«.  ♦  A.  -  6.00 


*2  *3  *1  «!  *,  ') 


!5 I  E6  *6 


>29,450  -1,390 


400,000  30,000  1,800 


BBEBBEBBEDHI 


Modifying  to  obtain  an  initial  faaaibla  solution: 


Table  2-6 


Solution  ia  not  optimal.  enters,  Ag  leaves. 
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M  CO  >  W 


It  la  not  feasible  to  deliver  any  material  fro*  storage  aita  2  to  service 
sona  3.  How  should  the  daaaad  at  aach  of  tho  four  service  zones  ba 
aatlaflad  to  minimise  tho  total  delivery  time? 

Solution  Procedure 

h.  ixsSgmto&B&L 

1.  System  Components 

a*  Crltarloo  Var labia  (S) t  Total  truckload-miles ;  tha  objective 
la  to  minimize  z. 

b.  Paclalon  Variable  (X^j)s  Truckloads  of  matarlal  1  to  service 
sons  j,  where  (i"1,2,3i  j*1,2,3,4). 

c.  Environmental  Variables t 

8  i  ■  truckloads  of  material  available  at  storage  site  i  ( i- 1 , 2 
Dj  »  truckloads  of  material  required  at  service  zone  j  ( j**1 ,2, 
S  -  total  subtly  at  all  sites 
0  *  total  requirement  at  all  zones* 


NlaialM  2  -  1.2lxi  ♦  3.4X12  +  2. 2X13  +  2.5X14  ♦  3.8X21  ♦  4.5X22 

♦  2.3X24  *  4*3X31  ♦  6.4X32  +  3.5X33  +  4.2X34 

».*.  xn  ♦  x12  ♦  x13  ♦  x14  <  200 

*21 +*22  ♦  *24  3  3*0 

*31  **32*  *33  +*34i150 

*  11  +  *21  +  *  31  *  125 

X12  ♦  X22  ♦  X32  -  150 

*13  ♦  *33  *  «* 

*  14  *  *  24  +  *  34  “  150 

*ij>  0  (1-1, 2,3;  j-1,2,3,4) 


This  LP  aodal  can  ba  solved  with  tha 
Bowrrar,  this  problem  can  bo  solved  some* 
transportation  (simplex)  algorithms 


el  simplex  algorithm, 
ere  efficiently  using  the 


PjMHj  2-4 


A  particular  subsystem  la  extensively  used  in  many  of  tha  mechanical 
systems  on  baaa.  Thara  ara  1,000  of  thaaa  aubayataaa  in  usa ,  each  of  which 
has  two  components.  Bach  subsystem  assembly  ia  configured  aa  followa: 


Figure  2-4 


Component  1  hae  a  reliability  of  0.925  and  component  2  haa  a  reliability  of 
0.90.  Replacement  components  are  available  from  two  sources.  They  can  be 
purchased  from  a  vendor  in  the  year  in  which  a  failure  occurs* 
Alternatively,  half  of  the  components  that  fail  in  one  year  can  be  repaired 
and  reconditioned  for  use  the  next  year.  Purchase  and  repair  costs  for 
each  of  these  components  are  shown  below: 


Table  2-16 


COMPONENT 

1 

2 

PURCHASE 

25 

50 

REPAIR 

20 

35 

Considering  only  the  next  two  years,  what  is  the  present  value  of  money  to 
be  programed  for  component  purchase  and  component  repair  (asstime  an 
interest  rate  of  10%). 

Solution  Procedure 

A.  System  Analysis 

1.  Components 

a.  Criterion  Variable  (Z):  Present  value  of  the  total  replacement 
cost*  objective  to  minimize  Z. 

b.  Decision  Variable  (X^j):  Number  of  units  of  type  i  for  year  j 


where: 


xij  >  0 

Solution  Algorithm 


,4»  j-1,2) 


The  problem  can  be  modeled  as  a  transportation  system  with  the 
following  sources  and  demand  points: 

Source:  1  -  purchased  units  of  component  1 
(i)  2  =  purchased  units  of  component  2 

3  *  repaired  units  of  component  1 

4  ”  repaired  units  of  component  2 


Demand: 
( j) 


1  “  component  1  required  in  year  1 

2  -  component  2  required  in  year  1 

3  ”  component  1  required  in  year  2 

4  -  component  2  required  in  year  2 


Table  2-17 


3 


|_M_ 


150  100 


I 


Initial  solution  by  least  cost  method  is  degenerate;  arbitrarily  place  e 
in  (1,2). 


Checking  for  optimality: 


(1,4): 

M 

(2,1): 

M 

(2,3): 

M 

(3.1): 

M 

(3,2): 

M 

(3,4): 

M 

(4,1): 

M 

(4,2): 

M 

(4,3): 

M 

41.3  +  45. 5  -  e  -  M+4.2 

22.7  +  M  -  45. 5  -  2M-68.2 
45.5  +  M  -  20.7  -  2M-66.2 

22.7  +  20.7  -  16.5  -  M-18.5 

H  +  20.7  -  16.5  -  4.2 

41.3  ♦  45.5  -  M  +  20.7  -  16.5 

22.7  +  M  -  45.5  +  41.3  -  28.9 

45.5  +  41.3  -  28.9  -  M-33.1 

20.7  +  M  -  45.5  +  41.3  -  28.9 


8.4 

2M-55.8 

M-53.8 


Solution  is  optimal. 


Z  -  150(22.7)  +  75(20.7)  +  100(45.5)  +  50(41.3)  +  75(16.5)  +  50(28.9) 

-  3405  +  1552.5  +  4550  +  2065  +  1237.5  +  1445 

-  14,255 


EXAMPLE  2-5 


Four  jobs  are  to  be  completed  by  four  structural  maintenance  and  repair 
crews.  The  superintendent  and  foreman  have  estimated  the  time  required  for 
each  crew  to  complete  each  job: 


Table  2-18 


H 

TASX 

1 

2 

3 

4 

CREW 

1 

10 

25 

16 

11 

2 

7 

26 

13 

21 

3 

35 

19 

18 

16 

m 

19 

26 

24 

10 

How  should  the  crew-task  assignments  be  made  to  minimize  the  total  time 
required  to  complete  these  tasks? 

Solution  Procedure 

A.  System  Analysis 

1.  System  Components 

a.  Criterion  Variable  (Z):  Total  time  to  complete  all  task; 


r-T1 


objective  la  to  minimise  Z. 

b.  Decision  Variable  (% j  )  t  X,  ,  -  1  if  crew  i  is  assigned  to  task 

j  (i,j  “  1,2,.. .,4);  0  otherwise.  3 

c.  Xnvironmental  Variable i 

Sj.  »  no.  of  tasks  assigned  to  crew  i  (i-1,2,...,4)  Si-1 

Oj  -  no.  of  crews  assigned  to  task  j  ( j«1,2,...,4)  Dj*l 

S  •  total  no.  of  crews  (4) 

D  -  total  no.  of  tasks  (4) 


2.  System  Relationships 


T 


Solution  is  not  optimal.  Tha  naw  matrix  is: 


Tabla  2*22 


Xll*  X23'  X32»  X44  "  1 
Z  -  10  +  13  +  19+10  -  52 


KXAWPLZ  2*6 

riva  diffarant  typas  of  dawataring  pump*  v*  availabla  for  visa  at  fiva 
diffarant  oon st ruction  sit as.  Tha  aff icianey  of  aaoh  pump  in  producing 
■axt—  yiald  at  aaoh  wall  is  shown  in  this  tabla  i 


60  I  40  I  6 


30  60  5 


70 

40 

6 

I 

30 

5 

1 

60 

40 

70 

20  I  50  I  3 


Determine  the 
for  ell  site*. 


type-site  ess lgnmsnt  that  will  maximize  total  efficiency 


(I) i  Total  pump  efficiency;  the  objective 


Is  to  maximise  Z. 

b.  Decision  Variables  (X*j)i  Xjj  -  It  if  pump  type  1  is  assigned 

to  site  j  (i-j-1,2,...,5)  *0  otherwise. 

c.  Xnvlrotsssntal  Variables 

Si  ■  number  of  sites  to  which  pump  type  1  can  be  assigned  (Sj-1) 
Dj  -  number  of  pump  types  which  can  be  assigned  to  site  j  (Dj»1) 
S  »  total  number  of  pump  types 
D  -  total  number  of  sites 


'  A-i-'-i  mrak 


Is 


An  optimal  solution  is  possible < 


Table  2-25 

rf  r 


5 


1 

2  1  3  I  4  |  S  1 

1 

20 

40 

0 

30 

20 

2 

o 

20 

0 

30 

0 

3 

30 

© 

10 

0 

30 

4 

20 

0 

40 

10 

© 

5 

60 

30 

50 

© 

20 

(1,3) 

70 

(2,1) 

60 

(3,2) 

60 

(4,5) 

80 

(5,4) 

80 

350 

EXAMPLE  2-7 


You  are  trying  to  determine  the  shortest  route  in  terms  of  travel  time 
from  the  main  fire  station  on  base  to  the  base  hospital.  You  have  the 
following  partial  map  of  the  base  on  which  the  travel  times  between  Inter- 


Solution  tTOCjtog 


1.  Lnbol  0:  0 

Cross  out  (1,0), (2,0) 

2.  (0,1)i  0  ♦  2  *  2  y 
Label  It  2 

Cross  out*  (3,1) , <4, 1) ,(6,1) 

3.  (0,2)  t  0  ♦  3  -  3/ 

(1.4) i  2+2-4 

Label  2 1  3 

Cross  out!  (4, 2), (5,2) 

4.  (1,4):  2  +  2-4/ 

(2.4) :  3+3-6 

Label  4:  4 

Cross  out:  (2,4),  (5,4),  (6,4) 

5.  (1,3):  2  +  3-5/ 

(2.5) :  3+4-7 

(4.5) :  4+2-6 

Label:  3:  5 

Cross  out:  (6,3) 

6.  (1,6):  2+5-7 

(2.5) :  3+4-7 

(3.6) :  5+2-7 

(4,5):  4  +  2-6/ 

Label  5:  6 

Cross  out:  (2, 5), (7, 5) 
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( 1,6) : 

(3.6) i 

(4.6) : 

(5.7) : 


2  +  5-7 

5  +  2-7 

4  +  2-6  / 

6  +  2-8 


Label  6:  6 

Croaa  out:  (1,6), (3, 6), (8, 6) 

8.  (5,7):  6  +  2-8/ 

(6.8) :  6+4-10 

Label  7:  8 

Cross  out:  (8,7) 

9.  (6,8):  6+4-10 

(7.8) :  8+1-9/ 

Label  8:  9 

Cross  out:  (6,8) 

The  optimal  solution  is:  8-7-5-4-1-0  or:  0-1-4-5-7-8  at  9  minutes . 


EXAMPLE  2-8 


A  facility  being  planned  for  construction  at  your  base  will  be  included 
in  Energy  Monitoring  and  Control  System  (EMCS) .  As  part  of  the  system, 
telephone  lines  must  be  run  from  the  existing  central  monitoring  and 
control  station  to  the  new  facility.  The  following  network  represents  the 
existing  system  of  conduits  and  junction  boxes  through  which  lines  can  be 
pulled.  What  is  the  shortest  route  connecting  the  new  facility  to  the 
central  station? 

Ficnira  2-8  _  _  . 

Control 


Station 


Facility 


1.  (0,3)  -  0  +  1000  -  1000 V 

Label  3  0  1000 
Cross  out:  (1,3), (2, 3) 


2.  (0,1)  -  0  +  2000  -  2000 V 

(3,4)  -  1000  4-  1000  -  2000 y 
Label  both  1  and  4  9  2000 
Cross  out:  (0,4) ,( 1,4) , (2,4) 


3.  (0,2)  -  0  +  2000  -  2000 1 

(1,2)  -  2000  +  1000  -  3000 

(4,6)  -  2000  4-  1000  -  3000 

Label  2  9  2000 

Cross  out:  (1,2) 


4.  (1,5)  -  2000  +  1500  -  3500 

(2.5)  -  2000  +  1120  -  3120 

(4.6)  »  2000  4-  1000  -  3000)/ 

Label  6  9  3000 

Cross  out:  (2, 6), (5, 6) 


5.  (1,5)  -  2000  4-  1500  -  3500 

(2.5)  -  2000  4-  1120  -  31207 

(4.5)  -  2000  4-  1500  »  35 00 

(6,7)  -  3000  4-  1500  -  4500 

Label  5  9  3120 

Cross  out:  (1,5), (4, 5} 


6.  (2,7)  -  2000  4-  2120  -  412o/ 

(4.7)  -  2000  4-  2120  -  412oV 

(5.7)  »  3120  4-  1000  -  41201^ 

(6.7)  «  3000  4-  1500  -  4500 
Label  7  «  4120 

Cross  out:  (6,7) 


Alternative  shortest  routes: 
0  -  2  -  5  -  7 
0  -  3  -  4  -  7 
0-2-7 
Z  -  4120  ft. 
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The  Base  Civil  Engineer  is  planning  the  development  of  a  recreation 
area  surrounding  a  lake  located  on  the  base.  The  planners/site  developers 
have  identified  ideal  locations  in  the  park  for  a  lodge ,  picnic  grounds,  a 
boat  dock,  camping  areas,  recreation  fields,  and  other  facilities.  These 
locations  are  represented  by  nodes  on  the  network  illustrated  below.  The 
branches  of  the  network  represent  possible  road  alternatives  in  the 
recreation  area.  If  the  park  designers  want  to  minimize  the  total  road 
miles  that  must  be  constructed  in  the  park  and  still  permit  access  to  all 
facilities,  which  road  alternatives  should  be  constructed: 

Figure  2-9 


Solution  Procedure 


This  is  a  minimal  spanning  tree  problem: 


Table  2-29 


V 

V 

V 

1 

2 

3 

V 

V 

5 

6 

(3,5)9 


3 


Arbitrarily  start  at  node  ( 1 ) 
Label  ()/)  node  (1) 

Connect  1-3 
Label  node  (3) 

Cross  out  (3,1) 

(1,2)  4 

(3.4)  2 i 
Connect  3-4 
Cross  out  (4,3) 

(1,2)  4V 

(3.5)  9 

(4.5)  7 
Connect  1-2 
Cross  out  (2,1) 

(2.5)  4V- 

(3.5)  9 

(4.5)  7 
Connect  2-5 
Cross  out  (5,2) 

(2.6)  5 

(3.5)  9 

(4.5)  7 

(5.7)  3V 
Connect  5-7 
Cross  out  (7,5) 

(2.6)  5 

(3.5)  9 

(4.5)  7 

(5.8)  5 

(7.8)  2T 
Connect  7-8 
Cross  out  (8,7) 

(2.6)  5 

(3.5)  9 

(4.5)  7 

(5.8)  5 

(7.6)  3V 

(8,5)  5 
Connect  7-6 
Cross  out  (6,7) 


Total  spanning  tree  length 


EXAMPLE  2-10 


One  configuration  of  the  MX  missile  deployment  system  being  studied  is 
illustrated  by  this  network  diagram: 

Figure  2-10 


Each  Launch  Control  Facility  (LCF)  directs  the  movement  of  a  mobile 
launcher  through  a  tunnel  system  linking  5  Launch  Facilities  (LF).  Find 
the  minimal  spanning  tree  representing  tunnel  links  connecting  the  LCF  and 
LFs. 


2-35 


6.  (0,5)  6 

( 1 #5)  8 

(2.5)  11 

(3.5)  10 
(4/5)  4 i 
Connect  4-5 
Label  5 

Cross  outs  (0,5),  (1,5),  (2,5),  (3,5),  (5,0),  (5,4),  (5,0),  (5,4), 
(5,1),  (5,3),  (5,2) 

Minimal  Spanning  Trees 

0-1  2 

0-2  3 

0-3  4 

3- 4  5 

4- 5  4 

18 


EXAMPLE  2-11 

An  independent  tenting  lab  has  been  retained  by  the  base  civil  engineer 
to  accomplish  materials  testing  for  a  major  on-base  construction  project* 
Five  separate  tests  are  to  be  accomplished.  These  tests  can  be 
accomplished  in  any  sequence.  However,  the  time  required  to  eet  up  the 
test  equipment  depends  on  the  sequence  in  which  the  tests  are  conducted. 
Equipment  set-up  times  (from  test  1  to  test  j)  are  as  follows s 


5 


12 


3 


1 


In  what  sequence  should  the  SMART  crew  be  routed  through  the  six  work  zones 
to  au.nuu.ze  the  total  relocation  and  set-up  time? 


Solution  Procedure 


This  problem  can  be  formulated  as  a  closed  circuit  problem: 


Table  2-43 


1 

2 

3 

4 

5 

6 

1 

D 

5 

15 

40 

25 

5 

0 

2 

20 

M 

15 

15 

n 

5 

3 

20 

10 

a 

■ 

10 

25 

n 

45 

5 

■ 

D 

■ 

25 

5 

35 

U 

10 

5 

■ 

0 

6 

m 

5 

15 

15 

5 

M 

5  10 


! 

i 


I  140 

i 


figure  2-17 


20 

15 

20 

15 

15 

15 

7l5 


Table  2-44 


2 

3 

4 

5 

6 

1 

■ 

10 

35 

20 

M 

2 

M 

15 

15 

5 

0 

5 

3 

10 

n 

m 

10 

25 

B 

5 

■ 

a 

0 

0 

15 

0 

5 

5 


10 


5 


EXAMPLE  2-13 

The  following  capacitated  network  represents  a  portion  of  tha  baa*  road 
ay a tan.  The  capacities  on  the  network  represent  the  maximum  average 
vehicle  traffic/nlnute  aa  determined  In  a  recent  traffic  survey. 


Table  2-48 


Solution  Procsdurs 


This  can  bs  aodslad 


5  lfr  15 


Q 


PSi  -  t4  -  ps2 

20 

PS1  *  *5  ‘  M2 

20 

psi  -  t2  -  t7  -  ps2 

10 

psi  '  -  t4  -  t6  -  ps2 

10 

PS1  *  *3  *  *5  *  T8  -  M2 

10 

PSi  -  t5  -  T?  -  ps2 

10 

120 

EXAMPLE  2-15 

The  Base  Civil  Engineer  has  given  you  the  opportunity  to  develop  and 
manage  the  accomplishment  of  a  small  project  requested  by  the  base  com¬ 
mander.  Although  not  complex,  the  project  is  time-critical  and  must  be 
closely  controlled.  After  analysing  the  project  system  and  discussing  it 
with  various  individuals  Mho  tall  be  involved  in  the  project,  you  have  com¬ 
piled  the  following  information: 

Table  2-50 

ACTIVITY  (X  -  j) 


-  1 
1  - 
1  - 


-  3 
3-4 
3-5 


a  “  optimistic  time  m  -  most  likely  time 


11 

6 

4 

14 

4 

3.5 

5.5 

9.5 
3 
3 


b  -  pessimistic  time 


2-52 


1.  Construct  the  project  system  network.  Assume  event  0  is  the  orlgxn  and 
event  5  xs  the  termxnal  node. 


2.  Compute  the  expected  txme  (t#)  and  variance  for  each  activity. 


3.  Find  the  critical  path. 


4.  Construct  a  graph  of  the  probability  of  completion  (Y  -  axis)  versus 
the  completion  time  in  days  (X  -  axis).  The  domain  should  vary  between  14 
days  and  20  days. 


5.  The  Base  Civil  Engineer  wants  to  give  the  base  commander  an  aetimated 
completion  date  in  which  the  confidence  level  is  at  least  98.0%.  How  many 
days  should  be  scheduled  for  completing  this  project  at  the  desired  con¬ 
fidence  level? 


Solution  Procedure 


(a  +  4m  +  b) 
6 


Cb-a)‘ 

36 


Table  2-51 


-  5 

-  3 
3-4 
3-5 


3 

7 

11 

7 

2 

2.5 

6 

3 

2 

3 

4 

3 

6 

7 

14 

8 

2 

3 

4 

3 

2.5 

3 

3.5 

3 

2.5 

4 

5.5 

4 

4.5 

5.5 

9.5 

6 

1 

2 

3 

2 

1 

2 

3 

2 

*<T5^  20)  -  0.9772 

(21-17)  . 


P(TS^  21): 


-  2.67 


(.4962) 


»<T|£21}  - 


0.9962 


You  Mod  to  schedule  at  least  21  days  for  a  confidence  level  ^98.0%. 
Alternatively: 


-  2.055  (2.055  -  98.0%) 


X  -  20.08  days 


EXAMPLE  2-16 

As  the  engineer  on  a 
following  data: 


11  construction  project,  you  have  compiled  the 


Table  2-52 


a"  represents  the  optimistic  completion  time  estimate  when  the  task  is 
"crashed"  or  expedited  at  the  associated  oost.  Determine  the  greatest  pro¬ 
bability  completing  the  project  in  18.0  days  (or  less)  if  $500  is  available 
for  expediting  the  project. 
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This  path  remains  critical  when  all  activities  are  crashed.  Consequently, 
it  is  only  necessary  to  crash  activities  0-1,  1-4,  and  4-6,  at  a  cost  of 
120  +  180  +  60  -  360  to  achieve  the  fully-crashed  te*  of  16.9  and  asso¬ 
ciated  variance  of  2. 13.  If  an  amount  less  than  360  was  available  for 
crashing,  the  activities  should  be  expedited  in  order  of  Increasing  crash 
ratio : 


Pr(te  ^  18) 


Pr 


0.5000  +  0.2734 


18-16.9 

•^2-13 


Activity 

Cost 
te  -  te# 

0  -  1 

400 

4-6 

600 

1  -  4 

900 

1.10 

1.10 

-  +0.75 

m  —  as 

1.46 

1.46 

0.7734 

If  the  project  is  not  crashed,  this  probability  is: 


18.0-17.5 


Pr 


0.5 
1.065 

0.5000  +  0.1808 


0.47 

0.6808 


Consequently,  the  probability  of  completing  the  project  In  18.0  days  or 
less  can  be  increased  from  0.68  to  0.77  by  spending  $360  to  crash  critical 
activities  0-1,  1-4,  and  4-6. 

EXAMPLE  2-17 


A  Base  Civil  Engineer  has  been  tasked  to  make  an  economic  analysis  to 
determine  if  the  siding  on  a  building  should  continue  to  be  painted  or  if 
vinyl  siding  should  be  applied.  The  following  data  is  available: 

a*  Painting:  O&M  costs  for  the  first  year  are  $500;  these  costs 
increase  at  $60  per  year  for  20  years.  Major  repair  during  5th  year  of 
$2,000,  during  the  10th  year  of  $3,000,  and  during  the  15th  year  of  $4,000. 

b.  Vinyl  Siding:  Initial  cost  of  $15,000.  No  O&M  Costs. 

Which  alternative  is  more  economical,  assuming  a  rate  of  return  of  10%? 

Solution  Procedure 


a.  Using  the  equivalent  annual  cost  method:  let  AC  *■  equivalent 
annual  cost  per  year,  A/G  “  the  factor  to  convert  a  gradient  series  to  an 
equivalent  uniform  annual  series  for  n  periods  (looked-up  in  appropriate 
table),  P/P  *  present  worth  factor  at  rate  1  for  period  n  (from  table), 

A/P  ■  uniform  series  capital  recovery  factor  at  rate  i  for  n  periods  (from 
table) 
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$500.00 


( 1 )  Painting: 


$60  (A/6,  i-10«,  n-20)  -  $60  (6.51) 

m 

390.00 

$2000 

$2000 

(P/F,  i-10%,  n-5)  •  (A/P,  i-10%,  n-20) 
(0.6209)  •  (0.11746) 

m 

145.86 

$3000 

$3000 

(P/F,  10%,  n-10)  •  (A/P,  i-10%,  n-20) 
(0.3855)  *  (0.11746) 

m 

135.84 

$4000 

$4000 

(P/F,  10%,  n-15)  •  (A/P,  1-10%,  n-20) 
(0.2394)  *  (0.11746) 

112.48 

m 

$1284.78 

ACX 
AC  2  - 
AC  3  « 

m 

AC4  - 

m 

AC  5  - 

TOTAL 


(2)  Vinyl  Siding: 

AC  -  $15,000  (A/P,  1-10%,  n-20) 

-  $15,000  (0.11746) 

b.  Using  the  present  value  method: 

( 1 )  Painting: 

$1284.78  (P/A,  1-10%,  n-20) 

-  1284.78  (8.514) 

(2) 

$1761.90  (P/A,  i-10%,  n-20) 

-  1761.90  (8.514) 


-  $1761.90 


$10,938.62 


-  $15,000.82 


EXAMPLE  2-18 


A  Base  Civil  Engineer,  prompted  by  the  Environmental  Protection  Agency 
and  others,  must  construct  a  new  sewage  disposal  facility.  The  base  is 
projected  to  grow,  and  he  knows  future  expansion  of  the  facility  is  inevi¬ 
table.  The  question  arises  whether  to  design  and  build  the  facility  to 
accommodate  added  capacity  at  a  later  date.  To  do  this  would  add  $20,000 
to  the  present  cost. 

Future  expansion  will  cost  $100,000.  However,  if  the  accommodation  is 
not  made  during  the  current  construction,  future  expansion  will  cost 
$140,000.  Assume  the  life  of  the  treatment  plant  to  be  50  years. 
Maintenance  costs  will  not  be  affected  with  or  without  the  accommodation. 

How  soon  oust  the  expansion  be  required  in  order  to  justify  the  addi¬ 
tional  $20,000  expenditure  now?  Assume  1-10%. 
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Solution  Procedure 


$20,000  spent  now  would  save  $40,000  n  years  front  now. 

(P/F,  i-10%,  n)  -  |-|2jl222.  -  0.5000 
F  40,000 

From  tables:  n  *  7.2  years. 

Alternatively : 

20,000  -  40,000  •  |  fiTioT"] 

( i. io)n  -  4£Lt222.  -  2.0 

20,000 

log  (1.10)  -  0.09531 

log  (2.0)  «  0.69315 

n  (0.09531)  -  0.69315 
n  -  0.69315/0.09531  -  7.27  years. 


Therefore 
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CHAPTER  3 


CONTRACTING  APPLICATIONS 

3-1  USING  PRICE  INDEX  NUMBERS  IN  DEFENSE  CONTRACT  PRICING* 


The  contracting  community  has  need  to  employ  many  different  tools  and 
techniques  in  estimating  the  prices  of  future  purchases.  The  high  probabi¬ 
lity  that  costs  and  prices  of  goods  and  services  will  change  over  time 
requires  development  of  approaches  to  estimate  that  change.  One  set  of 
approaches  uses  index  numbers  to  forecast  the  change  in  price  of  goods  and 
services. 

The  price  analyst  uses  index  numbers  for  three  general  purposes:  (1) 
to  deflate  or  inflate  prices  for  comparison  analysis,  (2)  to  project  price 
or  cost  escalation  in  contractual  documents,  and  (3)  to  inflate  or  deflate 
costs  to  facilitate  trend  analysis.  Index  numbers  are  used  in  price  analy¬ 
sis  to  compare  the  proposed  cost  of  an  item  with  the  cost  of  the  same  or 
similar  item  procured  in  past  years.  Here,  the  index  numbers  are  used  to 
discount  inflation  that  has  occurred  over  time  so  the  comparison  can  be 
made  in  constant  year  dollars.  Escalation  clauses  usually  call  for  some 
kind  of  after-the-fact  pricing  action  adjusting  the  price  paid  to  reflect 
actual  price  levels  at  the  time  of  contract  performance .  These  clauses  use 
index  numbers  to  measure  the  change  in  price  levels  over  time.  Index  num¬ 
bers  are  also  used  to  facilitate  trend  or  time-series  analysis  of  indivi¬ 
dual  cost  elements  by  eliminating  or  reducing  the  effects  of  inflation. 

The  analysis  can  then  be  performed  in  constant  dollars.  Each  of  these 
three  uses  of  index  numbers  in  contract  pricing  is  discussed  along  with 
some  examples. 

Price  Comparison  Analysis 

One  of  the  uses  of  price  index  numbers  is  to  measure  price  inflation. 
One  can  define  price  inflation  as  the  time  related  increase  in  price  of  an 
item  or  service  of  constant  quality  and  quantity.  Price  index  numbers  can 
be  used  to  compare  the  prices  of  the  same  or  similar  items  purchased  in 
different  time  periods  by  inflating  the  old  purchase  price  to  a  current 
time  period  or  deflating  a  current  price  to  some  old  purchase  time  period. 

Consider  the  problem  of  analyzing  a  contractor  proposal  of  $85,500  for 
a  turret  lathe  to  be  delivered  in  1976.  A  procurement  history  file  reveals 
that  the  same  machine  tool  was  purchased  in  1972  at  a  price  of  $48,500. 

The  task  is  to  determine  if  the  proposed  price  is  fair  and  reasonable. 


*Smith,  Larry  L.,  The  Use  of  Index  Numbers  in  Defense  Contract  Pricing, 
Technical  Report,  AU-AFIT-SL-1-76,  November  1976. 


The  approach  is  to:  select  or  construct  an  appropriate  index  series#  fore¬ 
cast  the  series  to  the  anticipated  date  of  production,  inflate  the  old  price 
to  current  dollars  and  compare.  The  Machinery  and  Equipment  Subindex  of  the 
Wholesale  Price  Industrial  Commodities  Index  (BLS)  is  selected  as  a  reaso¬ 
nable  indicator  of  price  movement  for  the  item.  The  data  of  Table  3-1  are 
extracted  from  the  1976  issue  of  the  "Economic  Report  of  the  President". 


Table  3-1 

MACHINERY  &  EQUIPMENT 


Year 

Index 

Year 

Index 

1967  (base) 

100.0 

1973 

121.7 

1971 

115.5 

1974 

139.4 

1972 

117.9 

1975 

161 .4 

The  data  need  to  be  forecasted  to  cover  the  anticipated  period  of 
production;  1976.  This  can  be  done  by  first  graphing  the  data  on  rec¬ 
tangular  coordinates,  hand  fitting  a  curve  to  the  data,  and  projecting  the 
curve  into  the  future.  See  Figure  3-1. 

For  short  range  forecasts  (2  years  or  less)  the  most  recent  data 
usually  give  the  best  indication;  therefore,  the  last  two  data  points  are 
used  for  a  straight  line  projection  to  183.0.  A  line  drawn  from  the  third 
from  last  point  through  the  last  point  gives  a  check  of  the  forecast  at 
181.0.  Thus  an  index  number  for  1976  between  181.0  and  183.0  appears 
reasonable.  Third,  inflate  the  1972  actual  price  to  1976  dollars  and  com¬ 
pare  the  result  with  the  proposed  price  of  $85,500.  This  is  done  by  first 
deflating  the  1972  price  to  1967  dollars  ($48,500  -5-  1.179  -  $41,137)  and 
then  inflating  this  result  to  1976  dollars  ($41,137  X  1.820  ■  $74,869). 

This  $74,869  is  in  1976  dollars  and  gives  one  indication  that  the  proposed 
price  of  $85,500  is  excessive. 

A  few  comments  are  in  order  here  to  highlight  possible  sources  of 
error.  First,  the  approach  assumes  constant  quality  and  quantity.  That 
is,  the  procuring  agency  is  buying  essentially  the  same  item  in  essentially 
the  same  quantities  as  purchased  in  1972.  Second,  it  is  assumed  that  the 
general  Machinery  and  Equipment  Index  is  representative  of  a  specific 
company's  turret  lathe.  In  fact,  the  Machinery  and  Equipment  Wholesale 
Price  Index  is  made  up  of  samples  of  production  of  many  different  kinds  of 
machines  and  equipment  produced  by  different  contractors  located  all  over 
the  country.  Third,  it  is  assumed  that  the  past  will  forecast  the  future 
and  that  a  line  drawn  through  data  points  will  predict  future  inflation. 
This  might  be  wrong  as  few  analysts  can  forecast  an  economic  turning  point. 

Nevertheless,  the  index  number  approach  to  price  analysis  gives  the 
analyst  another  tool  for  comparison.  It  can  be  used  to  check  prices  pre¬ 
dicted  by  other  methods  of  analysis  such  as  parametric  or  detailed  analysis 
approaches.  The  index  number  approach  can  also  be  used  as  a  basis  for 
price  negotiation  when  the  buyer  is  lacking  substantive  price  data. 
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Price  Escalation  Clauses 


There  is  a  need  for  some  Government  contracts  to  contain  a  price  read¬ 
justment  arrangement  providing  for  significant  unanticipated  fluctuations 
in  the  economy.  This  need  is  most  apparent  in  those  contracts  that  call 
for  performance  a  long  time  in  the  future  (more  than  two  years),  although 
sometimes  the  need  exists  for  shorter  term  adjustment.  Contracts  that  fall 
in  the  longer  range  category  are  typically  large  systems  production 
contracts  or  multiyear  production  contracts. 

The  possibility  of  unanticipated  fluctuation  in  the  economy  is  one  of 
the  elements  of  cost  risk  in  any  contract.  As  the  period  of  performance 
becomes  further  removed  from  the  time  the  contract  is  written,  the  risk 
becomes  greater.  Contractors  normally  include  some  contingency  dollars  in 
a  cost  proposal  to  compensate  for  this  risk  of  cost  overrun.  As  the  risk 
becomes  greater  with  longer  periods  of  contractual  coverage,  the  price  of 
the  contingency  becomes  unacceptably  high  from  the  Government's  point  of 
view.  This  risk  can  be  shifted  in  part  or  in  total  to  the  Government 
through  the  use  of  a  price  adjustment  clause. 

One  can  identify  two  general  approaches  to  constructing  price  adjust¬ 
ment  clauses  for  economic  fluctuations  in  large  dollar,  extended  production 
efforts.  One  is  to  construct  a  clause  to  compensate  for  any  and  all 
changes  in  the  economy.  The  other  is  to  construct  a  clause  to  compensate 
only  for  abnormal  fluctuations  in  the  economy.  Both  approaches  use  some 
index  number  series  as  a  basis  for  making  a  price  redetermination  at  the 
conclusion  of  the  effort. 

The  "any  and  all"  fluctuations  approach  is  simpler  and  easier  to 
understand.  If  the  index  number  forecast  for  some  specified  future  period 
of  performance  is  higher  or  lower  than  actual,  then  the  contract  costs  ori¬ 
ginally  predicted  using  the  forecast  index  number  would  be  adjusted  to 
reflect  actual  index  numbers  extant  at  the  time  of  performance.  For  a  spe¬ 
cific  element  of  cost,  the  "any  and  all”  fluctuation  clause  might  include  a 
repricing  formula  as  follows: 


Price  Adjustment  -  Actual  Index-Forecast  Index  Target 

Forecast  Index  Cost 


The  probability  of  correctly  predicting  the  level  of  the  economy  and  its 
associated  index  number  is  low,  thus  making  the  need  for  some  repricing 
highly  probable.  If  the  index  numbers  chosen  accurately  relate  a  particlar 
product  to  the  economy,  this  approach  eliminates  all  of  the  contractor's 
risk  associated  with  fluctuations  in  the  economy. 

The  "abnormal"  fluctuations  approach  assumes  that  a  normal  range  of 
economic  change  can  be  defined  and  that  prices  will  be  adjusted  only  if  the 
economic  indicators  fall  outside  of  that  normal  range.  Figure  3-2  portrays 
the  concept. 
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Figure  3-2 

Fluctuation  Around  an  Index  Number  Trend 


Normal 
Range  of 
Fluctuation 


Number 


Time  (Years) 


This  approach  requires  two  formulations  of  the  adjustment  equation  for  each 
eleme  t  of  cost  to  be  repriced*  One  formulation  adjusts  the  costs  upward 
if  the  actual  index  exceeds  the  high  side  of  the  range  and  the  other  for¬ 
mulation  adjusts  the  costs  downward  if  actual  index  falls  below  the  low 
side  of  the  range.  For  example: 


Adjustment 


Adjustment  ■ 


Actual  Index-High  Forecast  Index 
High  Forecast  Index 


Actual  Index-Low  Forecast  Index 
Low  Forecast  Index 


Target 

Cost 


Target 

Cost 


Selection  of  the  range  itself  is  a  rather  arbitrary  process,  but  it  can  be 
generalized  that  the  wider  the  range,  the  greater  the  amount  of  risk 
shifted  to  the  contractor*  A  reasonable  range  might  be  plus  or  minus  one 
standard  deviation  of  the  data  from  which  the  trend  line  was  originally 
defined.  Figure  3-2  is  a  graphical  portrayal  of  the  concept*  In  addition  to 
selecting  the  general  approach  to  construction  of  the  clause,  "all  or  any" 
versus  "abnormal,"  the  contracting  parties  must  agree  on  some  other 
variables  in  selecting  methods  for  price  adjustment  due  to  economic 
changes.  Two  of  these  other  decisions  are  whether  to  apply  the  adjustment 
to  forecast  costs  or  actual  costs  and  whether  to  use  the  forecast  index  or 
the  actual  index  as  the  denominator  in  the  adjusting  fraction. 

The  adjustment  might  be  a  function  of  either  target  costs  or  actual 
costs,  e.g.: 


Adjustment 


Actual  Index-Forecast  Index 
Forecast  Index 


Forecast 

Cost 


Adjustment 


Actual  Index-Forecast  Index 
Forecast  Index 


Actual 

Cost 


Opponents  of  actual  costs  as  a  basis  for  adjustment  advance  the  thought 
that  if  overruns  or  underruns  occur,  the  amounts  of  the  overruns  or 
underruns  would  unfairly  influence  the  adjustment.  Another  argument 
against  the  use  of  actual  costs  is  the  difficulty  in  defining  them  since 
allowability  of  costs  is  often  subject  to  negotiation.  A  third  argument 
against  the  use  of  actual  costs  as  the  basis  for  adjustment  is  time  delay. 
Often  actual  costs  are  not  audited  for  years  after  contract  completion. 

Advocates  of  actual  costs  as  a  basis  for  adjustment  believe  the  pur¬ 
pose  of  the  adjustment  clause  is  to  adjust  the  price  for  causes  beyond  the 
contractor's  control.  Therefore,  the  adjustment  should  be  made  on  the  basis 
of  actual  costs  whatever  they  might  be. 

Another  decision  facing  the  constructor  of  escalation  adjustment 
clauses  is  the  need  to  decide  on  which  factor  to  use  in  the  denominator  of 
the  adjustment  fraction.  Precedent  has  established  that  the  denominator  be 
the  actual  index  when  actual  costs  are  adjusted  and  that  the  forecast  index 
be  the  denominator  when  forecast  costs  are  to  be  modified,  e.g.: 


Adjustment 


Actual  Index-Forecast  Index 
Forecast  Index 


Forecast 

Cost 


Adjustment 


Actual  Index-Forecast  Index 
Actual  Index 


Actual 

Cost 
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It  is  apparent  that  in  a  period  of  rising  economic  indicators,  that  the 
cost  adjustment  fraction  (and  the  resulting  adjustment)  would  be  smaller  if 
the  actual  index  were  used  as  the  denominator.  It  is  intuitively  better  to 
use  such  a  smaller  adjustment  fraction  with  actual  costs  that  already 
include  the  effects  of  inflation.  On  the  other  hand,  for  those  adjustment 
equations  based  on  forecast  costs,  it  appears  more  fair  to  use  the  forecast 
index  as  the  denominator  of  the  adjustment  fraction. 

A  third  decision  facing  the  constructor  of  escalation  clauses  is  the 
choice  of  index  series  and  the  weights  to  apply  to  each  series  used.  For 
example,  one  Air  Force  systems  acquisition  contract  for  aircraft  uses  two 
economic  indicator  series  to  adjust  the  price  for  inflation.  The  two 
series  are  the  BLS,  Wholesale  Price,  Industrial  Commodities  Index  and  a 
wage  rate  series  from  the  BLS  yearly  average  wage  rates  for  the  Durable 
Goods  category  of  Production  Workers  in  Manufacturing  by  Major  Manufacturing 
Groups.  The  Industrials  Commodities  Index  is  chosen  to  represent  the 
materials  portion  of  the  aircraft  price  and  is  weighted  25%  of  the  total 
price.  The  wage  rate  series  is  chosen  to  represent  the  labor  portion  of 
the  price  and  is  weighted  75%  of  the  total  price.  The  redetermination 
clause  is  of  the  form: 


Adjustment  -  Actual  Index-Forecast  Index  x  Forecast 

Forecast  Index  Cost 


The  clause  constructor  needs  to  trade  off  clause  complexity  and  the 
subsequent  difficulty  of  constructing  and  administering  the  clause  against 
level  of  accuracy  desired.  The  above  mentioned  aircraft  procurement  esca¬ 
lation  clause  is  simple  and  easy  to  construct  and  administer.  The 
contracting  parties  take  the  risk,  however,  that  the  economic  indicator 
series  jure  not  closely  related  to  the  airframe  manufacturing  industry. 

There  is  little  consideration  of  indirect  cost  factors  which  often  change 
at  a  different  rate  than  direct  cost  factors. 

The  method  chosen  to  forecast  the  index  trend  is  a  fourth  major  con¬ 
sideration  in  clause  construction.  A  straight  line  regression  analysis 
built  on  price  index  numbers  of  the  1960s  will  seriously  underestimate 
price  indexes  of  the  1970s.  It  is  a  good  idea  to  plot  the  data,  discern  a 
trend  and  best  fit  a  trend  model  through  statistical  analysis  for  projec¬ 
tion  purposes.  A  straight  line  has  not  been  a  good  tool  for  such  projec¬ 
tion  in  recent  years. 

The  ideal  contract  clause  for  adjustment  of  price  due  to  inflation 
does  not  seem  to  exist.  If  the  clause  closely  approximates  the  economic 
changes  of  a  specific  product,  it  is  complicated  to  construct  and  administer 
If  the  clause  is  simple  to  construct  and  administer,  it  does  not  relate  well 
to  the  product  for  which  the  economic  adjustment  is  sought.  Recognizing 
that  all  solutions  are  compromise  solutions,  the  following  ideas  are 
suggested  as  a  starting  place  from  which  to  construct  an  adjustment  clause. 


First,  select  the  normalcy  band  adjustment  clause  form.  For  example: 


I 

■  High  Side  Adjustment 


Actual  Index-High  Side  Forecast  Index  Forecast 
High  Side  Forecast  Index  x  Cost 


This  adjustment  form  uses  a  forecast  index  and  a  forecast  cost  (target  cost) 
By  using  a  forecast  cost  as  a  basis  for  adjustment,  the  administrative  pro- 
|  blems  associated  with  determining  allowability  of  actual  costs  are  avoided. 

Second,  select  at  least  three  index  series  to  represent  the  price  of 
the  product.  One  series  should  be  related  to  the  materials  used  in  or 
purchased  for  the  product  being  priced.  The  second  series  should  relate  to 
direct  labor  of  the  type  used  to  produce  the  product.  The  third  series 
should  relate  to  the  indirect  costs  on  the  product  being  priced.  Heights 
|  totaling  100%  need  to  be  assigned  to  each  category  of  costs.  After 

selecting  the  index  series,  each  needs  to  be  forecast  for  the  period  of 
performance.  For  short  range  forecasts  (less  them  2  years)  an  exponential 
smoothing  model  best  fitting  each  set  of  data  should  be  used.  For  long 
range  forecasts  (2-10  years)  a  trend  model  best  fitting  each  set  of  data 
should  be  used.  Generally  speaking,  one  should  use  more  than  10  years  of 
I  data  to  discern  this  trend. 


Third,  the  forecasted  index  numbers  should  be  used  to  inflate  the  out- 
year  price  estimate  from  current  dollars  to  future  dollars  for  target 
pricing  purposes.  This  task  is  necessary  whether  or  not  an  economic 
adjustment  clause  is  included  in  the  contract. 

Fourth,  a  width  of  the  normalcy  band  for  each  series  of  data  must  be 
defined.  A  zero  width  band  means  that  the  Government  assumes  all  the  risk 
of  inflation.  Rather  arbitrarily,  a  normalcy  band  of  +  one  standard  error 
of  the  mean  is  suggested  as  a  negotiation  starting  point  for  a  normalcy 
band.  This  affords  the  contractor  protection  against  "abnormal"  inflation 
and  the  Government  still  does  not  assume  all  the  risk.  Each  economic 
series  would  have  a^sormalcv  band.  If  actual  index  numbers  move  outside 
the  normalcy  band  for  the  period  being  priced,  the  contract  should  provide 
that  the  price  of  the  affected  element  (materials,  labor  or  indirect)  would 
be  adjusted  in  accordance  with  the  formulation.  Such  a  clause  should  con¬ 
tain  as  a  minimum;  (1)  the  formulation  of  the  repricing  arrangement,  (2) 
the  index  series  source  to  be  used  to  determine  the  actual  indexes  when 
repricing,  (3)  an  example  of  how  the  two  parties  expect  the  repricing  for¬ 
mula  to  work  if  it  is  exercised. 


Price  Deflators  to  Facilitate  Regression  Analysis 

Price  deflators  can  be  used  to  deflate  prices  to  a  constant  dollar 
basis  to  facilitate  regression  analysis.  Many  times  cost  data  are  aggre¬ 
gated  by  function  or  category  of  incurrence  for  accounting  purposes.  These 
data  are  also  related  to  the  flow  of  time  because  of  the  practice  of  keep¬ 
ing  accounts  by  accounting  period.  An  example  of  such  an  account  would  be 
a  general  and  administrative  (G&A)  account,  one  of  the  contractors  indir¬ 
ect  cost  pools.  G&A  pool  data  are  usually  available  on  a  yearly  basis 


as  far  back  as  the  contractor  keeps  records.  When  the  analyst  needs  to  pre¬ 
dict  costs  such  as  G&A  costs  for  future  years  of  operation,  he  needs  to  find 
some  level  of  activity  indicator  that  causes  G&A  costs  to  vary.  An  example 
of  such  an  indicator  might  be  the  number  of  direct  employees,  number  of 
total  employees ,  cost  of  sales  or  other  similar  independent  variables. 

Before  one  begins  the  regression  analysis  of  the  dependent  variable 
(pool  dollars)  on  the  independent  variable(s)  (level  of  activity  indicator), 
the  dollars  need  to  be  deflated  to  a  constant  dollar  basis.  This  is  because 
the  dollars  were  actual  dollars  expended  in  different  periods  of  time.  These 
dollars  include  the  effects  of  inflation  and  they  have  different  values. 

By  selecting  a  price  index  number  series  that  represents  the  kinds  of 
costs  that  are  in  the  aggregation,  and  deflating  each  dollar  grouping  by 
its  corresponding  index  number,  the  effects  of  inflation  can  be  removed. 

The  dollars  are  converted  to  a  constant  (base  year)  basis  suitable  for 
regression  analysis,  forecasting  and  subsequent  reinflation  for  pricing 
purposes.  The  following  example  is  intended  to  illustrate  the  use  of  index 
numbers  when  forecasting  manufacturing  overhead  costs  for  a  future  year  of 
contractor  operations.  Table  3-2  is  a  summary  of  cost  data  keyed  to  the 
explanatory  notes  that  follow. 


Table  3-2 


COST  DATA  SUMMARY 


Year  Number 

1 

2 

3 

4 

5 

6 

Year 

1967 

1968 

1969 

1970 

1971 

1972 

Actual  Cost® 

$32,670 

$41,111 

$43,879 

$51,276 

$59,432 

Activity  Level 

Direct  Labor 

Hours  (DLH)b 

17,800 

21,000 

22,800 

27,100 

35,900 

Index  Numbersc 

100.0 

104.8 

108.9 

112.4 

114.8 

Deflated 

COStrf* 

$32,670 

$39,228 

$40,293 

$45,619 

$51,170 

Activity  Level 

Forecast  (DLH)e 

35,000 

Deflated  Over- 

Deflated  OH  $  Cost  *  A  + 

B  (DLH) 

head  Cost  Model 

&  Deflated 

-  17,688  +  0.96734  (DLH) 

Forecast? 

51,544 

Index  Nr.  Cost 

Index  No.  ■  A  + 

B  (year  number) 

Model  &  Forecast? 

-  97  + 

3.72  (year  number) 

119.3 

Actual  Cost 
Forecast*1 


61,492 


TABLE  2  -  Continued 


MOTES 


a  Actual  costs  for  past  years  of  operation  are  collected  from  the 
contractor's  accounting  records.  These  costs  may  need  to  be  adjusted  to 
compensate  for  changes  in  the  accounting  system  from  year  to  year. 

^The  activity  level  selected  to  explain  the  variation  in  overhead  costs 
from  year  to  year  is  direct  labor  hours  (DLH) .  Historical  DLHs  are 
collected  for  years  corresponding  to  the  overhead  cost  data. 

cThe  index  numbers  listed  here  were  constructed  from  contractor's  indirect 
cost  data  for  prior  years  using  the  weighted  average  of  Price  Relatives 
(current  year  weights)  approach.  The  analyst  might  also  have  selected  a 
previously  constructed  index  series  that  was  representative  of  the 
contractor's  overhead  cost  pool  make-up. 

^The  actual  costs  are  deflated  to  constant  1967  dollars  by  dividing  each 
year's  overhead  costs  by  the  corresponding  index  number. 

eThe  forecast  of  direct  labor  hours  for  the  future  year(s)  is  a  function  of 
the  contractor's  sales  projection.  Necessarily  then,  the  contractor  must 
assist  the  analyst  with  the  forecast. 

^The  deflated  overhead  cost  model  is  a  simple  straight  line  derived  by 
regressing  deflated  costs  on  the  corresponding  activity  level  (DLH) .  The 
deflated  overhead  cost  forecast  is  calculated  from  the  cost  model  by 
inputting  the  forecast  direct  labor  hours,  35,000. 

®The  index  number  cost  model  is  a  straight  line  derived  by  regressing  the 
index  number  of  the  corresponding  year  number  (1,2, 3, 4  and  5).  The  index 
number  forecast  ms  calculated  from  the  cost  model  by  inputting  year  6. 

^The  actual  cost  forecast  is  calculated  by  multiplying  (inflating)  the 
forecast  deflated  overhead  costs  51,544  by  the  forecast  index  number,  119.3 

3-2  TIME-SHARING  SERVICES  AMD  COPPER  IMPACT* 

Computers  and  Pricing 

By  its  very  nature,  the  contract  pricing  and  financial  analysis  process 
involves  two  major  factors:  first,  the  informed  judgment  of  the  analyst 
based  on  experience  and  training)  and  second,  a  considerable  amount  of  compu 
tation.  A  comprehensive  study  of  the  pricing  function  in  1972  revealed  that 
a  disproportionate  amount  of  the  professional  price  analyst's  time  was  being 
spent  in  computations  and  related  mechanical  and  administrative  tasks,  leav¬ 
ing  little  or  no  time  for  the  analyst  to  thoughtfully  probe  all  issues  in  a 
cost  or  price  proposal.  The  principal  computational  tool  was  the  standard 
mechanical  calculator. 

*Sandman,  Tom.  Contracting  Applications  of  Copper  Impact,  Department  of 
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The  obvious  candidate  to  assist  the  analyst  with  his  computation  task  was  the 
digital  computer •  This  offers  several  features  of  interest  to  the  analyst: 

-  Accuracy;  the  computer  in  its  present  state  rarely  makes  computa¬ 
tional  errors  which  are  not  based  on  improper  programming  or  faulty  data. 

-  Speed;  once  a  job  enters  the  computer,  it  is  capable  of  easily 
doing  computations  in  a  few  seconds  which  would  take  hours  manually. 

-  Flexibility;  once  a  program  is  established,  any  number  of  data  sets 
may  be  processed  in  an  identical  manner  or,  conversely,  with  a  minor 
program  change,  the  same  data  set  can  be  processed  differently. 

Until  the  late  1960 's,  bringing  the  power  of  the  computer  to  the  field 
analyst  was  difficult  and  time-consuming  since  it  usually  involved  use  of 
someone  else's  program  and  passing  information  and  data  through  the  mail  to 
the  computer  center.  The  advent  of  the  time-sharing  computer  solved  this 
problem  by  providing  a  means  to  gain  on-demand  access  to  the  computer  central 
processor  from  a  remote  terminal  over  telephonic  communication  lines. 

Overview  of  Time-Sharing 

The  time-sharing  computer  is  a  complex  of  devices  which  organize  and 
direct  multiple  access  to  a  single  central  processing  unit.  A  job  control 
computer  processes  several  jobs  simultaneously  in  the  same  central  pro¬ 
cessor  by  efficiently  utilizing  the  computation  space  of  the  central  pro¬ 
cessor.  Through  a  priority  system  and  a  complex  set  of  queuing  logic, 
individual  steps  of  several  jobs  can  be  organized  in  the  central  processor 
so  as  to  make  the  most  efficient  use  of  the  time  available. 

The  other  half  of  the  time-sharing  equation  relates  to  communication 
with  the  central  processor.  Through  the  use  of  communications  terminals 
(teletype  or  CRT)  and  a  network  of  communication  lines,  the  user  of  the 
computer  can  be  located  virtually  an  unlimited  distance  from  the  computer 
itself.  From  the  user's  standpoint,  however,  the  computer  might  as  well  be 
in  his  off:  :e  since  all  the  computational  power  is  delivered  to  him  via  the 
terminal  and  a  set  of  very  specific  words  called  system  commands. 

Overview  of  Applications 

The  focus  of  this  section  is  to  catalog  and  describe  the  myriad  of 
applications  of  the  time-sharing  computer  to  the  pricing  and  financial 
management  task  in  the  Air  Force.  The  applications  generally  fall  into 
five  major  categories,  discussed  in  the  following  paragraphs: 

a.  Cost  Models 


Cost  proposal  logic  simulation  is  a  more  appropriate  phrase  for 
the  process  of  simulating  with  a  computer  program  the  logical  build-up 
and  cost  relationships  of  a  specific  cost  proposal.  Given  a  model  of  a 
contractor  proposal,  the  analyst  can  very  quickly: 


3-11 


-  Arithmetically  audit  a  proposal 


-  Compute  his  own  negotiation  objective  based  on  the  results  of 
his  analysis. 

-  Recompute  his  position  during  negotiations. 

-  Generate  neatly  printed  and  formatted  outputs  of  the  traditional 
"spread-sheet"  display  of  cost  proposal  analysis  and  results  (usually  in 
tabular  form) . 

Another  reason  for  cost  models  is  the  performance  of  "sensitivity"  analy¬ 
sis,  or  the  "what-if"  game.  The  analyst,  through  a  series  of  model  runs, 
can  determine  which  basic  cost  elements  drive  the  most  significant  changes 
in  bottom-line  price  features. 

b.  Overhead  Management 

Several  years  ago,  the  Air  Force  developed  PIECOST,  which  was  an 
early  initiative  in  automated  overhead  negotiation  and  tracking  procedures. 
Basically,  PIECOST  extrapolated  prospective  pricing  data  from  history  using 
various  statistical  techniques.  Since  those  earlier  days,  computer  appli¬ 
cations  in  the  overhead  area  have  evolved  into  sophisticated  cost 
accounting  system  simulation  models.  The  old  PIECOST  acronym  has  been 
replaced  by  "MODE",  referring  to  Management  of  Overhead  Discrete 
Evaluation.  In  essence,  these  models  provide  the  Air  Force  Plant 
Representative  Offices  with  greater  visibility  into  indirect  cost  build-up 
from  incurrence  to  final  rate  development.  These  models  enhance  the  nego¬ 
tiation  of  forward  pricing  rates  by  performing  the  time-consuming  manipula¬ 
tion  of  discrete  cost  inputs.  The  advantages  of  such  models  are 
essentially  those  of  the  cost  models  discussed  previously. 

c.  Workload  Management 

This  application  area  is  largely  self-explanatory.  The  basic 
approach  is  to  automate  the  pricing  case  register  in  such  a  way  as  to 
satisfy  the  following  information  needs: 

-  Local  management  visibility  of  case  load  to  facilitate  resource 
allocation  decisions. 

-  Internal  management  information  for  overall  management  decisions. 

-  Higher  headquarters  workload  information  for  policy  decisions 
and  surveys. 

d.  Data  Banks 


The  computer  has  long  acted  as  an  excellent  medium  for  the  storing 
and  retrieval  of  information.  The  time-sharing  computer  permits  a  wide 
range  of  possibilities  for  centrally  storing  and  maintaining  information  to 
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be  used  at  widely  dispersed  locations*  the  primary  application  in  COPPER 
IMPACT  is  the  centralized  pricing  rate  and  factor  data  banks ,  CONRATES, 
discussed  elsewhere  in  this  paper. 


e.  Analysis  Aids 

Numerous  individual  programs  have  been  developed  under  COPPER 
IMPACT  whose  sole  purpose  is  to  aid  the  price/financial  analyst  in  per¬ 
forming  in-depth  analyses.  In  addition,  the  contractor's  software  library 
contains  numerous  programs  of  interest  to  pricing  and  finance  personnel. 
These  programs  are  available  to  any  user  through  the  Copper  Impact  Library. 

Copper  Impact  Applications 

Introduction 


A  broad  overview  of  the  nature  of  computer  applications  to  contract 
pricing  was  presented  in  the  previous  section.  This  section  includes  addi¬ 
tional  detail  which  will  form  the  foundation  upon  which  a  user  may  build 
his  individual  application. 

This  section  is  not  a  user  guide  to  each  program,  but  rather  a 
reference  guide  to  application  concepts  and  known  programs  which  have  been 
developed  to  date.  Further  detail  on  the  specifics  of  using  programs  in 
this  section  should  be  obtained  from  the  responsible  organization  noted 
with  each  given  program  and/or  system. 

Cost  Proposal  Simulation  Models 


a.  Overview  of  Modeling 

The  purpose  of  a  cost  proposal  simulation  model  is  to  provide  the 
price  analyst  with  a  computational  tool  of  considerable  worth  in  reducing 
the  mechanical  tasks  related  to  cost/price  analysis.  Such  a  model  provides 
a  tool  to  audit  a  proposal,  develop  a  position,  recompute  a  position,  or 
conduct  sensitivity  analysis.  These  tasks  are  accomplished  quickly  and 
accurately  by  the  computer  which  also  generates  a  neatly  typed  output  tabu¬ 
lation  suitable  for  direct  inclusion  in  a  formal  report  or  case  file 
without  being  retyped. 

Cost  models  are  built  for  a  specific  type  of  proposal  format  from 
a  given  contractor.  The  model  itself  is  structured  to  accurately  reflect 
the  arithmetic  relationships  of  basic  inputs  (direct  cost  amounts,  rates, 
and  factors)  and  their  build-up  to  the  final  proposal  amount.  The  model 
usually  provides  for  an  output  report  which  displays  the  three  standard 
elements,  proposed,  objective  and  negotiated.  Figure  3-3  shows  a  typical 
model  structure  and  flow  of  information. 

Developing  a  cost  proposal  model  poses  several  questions  which 
need  answers  prior  to  beginning  the  actual  task  of  programming  the  model. 
The  type  of  considerations  to  be  made  include  the  following: 


-  A  complete,  detailed  understanding  and  working  knowledge  of 
the  contractor's  estimating  and  proposal  system  is  essential.  An  important 
element  is  the  identification  of  those  costs  which  must  be  direct  inputs 
versus  those  which  are  derived  from  rate  or  factor  application. 

-  Are  the  contractor's  proposals  sufficiently  similar  in  format 
and  logic  that  one  or  two  models  can  be  developed  which  handle  all  proposal 
types?  Alternatively,  must  a  separate  model  be  constructed  for  each  proposal 
type? 


Figure  3-3 
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-  Are  rate  and  factor  structures  sufficiently  general  that  a 
single  rate  table  can  be  used  for  several  different  proposal  types? 

-  Is  a  single  output  report  format  satisfactory  or  are 
multiple  report  options  desirable?  Are  time-period  spreads  of  data 
necessary? 

Cost  models  have  proven  very  useful  in  all  aspects  of  the  pricing 
process,  and  savings  have  been  documented  at  from  3:1  to  10:1  in  terms  of 
manhours  consumed  in  pricing  computations  and  report  generation. 

b.  Proposal  Pricing  System  (PPS) 

COPPER  IMPACT  has  sponsored  the  development  of  two  cost  model 
building  systems  which  permit  the  user  vithout  knowledge  of  a  programming 
language  to  construct  cost  models  for  specific  proposal  types. 

One  of  these  systems  is  called  the  Proposal  Pricing  System  (PPS). 
The  system  is  fully  described  in  the  PPS  User  Guide  available  from 
Headquarters  AFSC/PMMP,  Andrews  AFB,  DC  20334.  The  system  relies  on  a 
set  of  codes  as  inputs  to  describe  the  model  for  a  model  building  program. 
Input  data  for  a  given  proposal  is  input  from  a  second  file,  processed,  and 
the  results  put  in  a  master  data  file.  Reports  are  generated  with  either 
the  standard  system  report  modules  or  specialized  report  routines  prepared 
by  the  user.  Figure  3-4  provides  an  overview  of  the  flow  of  information  in 
the  PPS  System. 

The  principal  utility  of  PPS  is  in  reducing  the  amount  of  time 
required  to  develop  a  cost  model  using  a  programming  language  such  as  BASIC 
or  FORTRAN.  Savings  in  this  area  range  from  6:1  to  10:1.  These  savings 
are,  however,  offset  by  the  cost  of  running  each  case  through  the  model. 
Therefore,  as  a  rule,  if  a  model  is  to  be  used  frequently  throughout  the 
year,  the  model  should  be  specifically  programmed  in  a  language  (BASIC  or 
FORTRAN)  rather  than  using  PPS. 

c.  Programmable  Cost  Model 

The  Programmable  Cost  Model  is  identical  in  purpose  to  the  Proposal 
Pricing  System,  but  relies  primarily  on  an  approach  in  which  the  operator 
interacts  conversationally  with  the  program  when  defining  the  model  and 
providing  data  input.  Specific  information  and  detailed  user  documentation 
cam  be  obtained  through  Headquarters  AFLC/PPPP,  Wright-Patterson  AFB,  Ohio 
45433* 

d  Specialized  Cost  Models 

Many  specialized  cost  models  have  been  programmed  by  COPPER  IMPACT 
users  to  apply  to  specific  cost  proposal  formats  from  specific  contractors. 
These  models  are  available  to  any  user  through  the  COPPER  IMPACT  Library. 
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Financial  Analysis  Lan 
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FAlall  is  a  contractor-provided  software  package  designed  to  pro¬ 
vide  tabular  reports  of  a  financial  analysis  nature.  The  language  uses 
simple  English- like  statements  and  does  not  require  a  high  degree  of  exper¬ 
tise  in  programming  in  order  to  be  used  effectively.  By  using  a  series  of 
rows  and  columns/  the  user  can  establish  a  program  which  processes  data  in 
numerous  ways.  The  system  has  been  used  effectively  in  establishing  cost 
proposal  models,  but  has  numerous  other  applications  such  as  time-phased 
cash  flow  analysis  and  budget  development.  As  with  most  other  software 
packages  of  this  type,  the  overhead  involved  in  making  the  system  easy  to 
use  and  quick  to  implement  also  makes  it  somewhat  expensive.  A  user  should 
be  prepared  to  trade  operating  cost  for  development  time  in  using  FALXI. 

Overhead  Management 

a.  Overview 

Since  the  late  1960's,  the  DOD  procurement  community  has  been 
placing  increasing  emphasis  on  the  management  and  monitorship  of  indirect 
cost.  Organizations  devoted  to  overhead  management  have  been  established 
which  utilize  advanced  computer  applications  to  enhance  their  cost  analysis 
and  negotiation  functions.  A  prime  example  of  this  type  of  organization  is 
the  Business  Management  Branch  within  an  Air  Force  Plant  Representative 
Office  (AFPRO). 

b.  PIECOST 

One  of  the  original  applications  of  the  time-sharing  computer  to 
the  overhead  cost  analysis  process  was  an  approach  known  as  PIECOST 
(Probability  of  Incurring  Estimated  Cost).  The  basic  premise  of  PIECOST 
was  that  indirect  expense,  when  properly  categorized,  could  be  forecasted 
on  the  basis  of  historical  extrapolation  of  pricing  data  through  the  use  of 
correlation  and  regression  analysis.  Experience  has  shown  that  while  the 
concept  may  be  valid,  the  forecasting  technique  and  indeed  the  bulk  of  the 
approach  proved  to  be  of  limited  use  in  negotiating  actual  rate  agreements 
with  contractors.  This  is  not  to  say  that  statistical  techniques  are 
inappropriate  in  the  contract  pricing  function.  On  the  contrary,  they  are 
used  by  the  Air  Force  when  it  is  practical  to  do  so. 

The  primary  benefit  of  PIECOST  was  that  it  provided  a  philosophi¬ 
cal  framework  from  which  the  current  computer  applications  were  developed. 
Some  of  the  computer  programs  used  within  the  PIECOST  application  package 
are  still  used  as  cost  tracking  aids.  Specific  details  on  their  use  can  be 
obtained  from  the  Air  Force  Contract  Management  Division  (AFCMD/TMO, 
Kirtland  AFB,  M  87117). 


MODE  (Mana 


nt  of  Overhead  Discrete  Evaluation 


The  MODE  model  is  the  principal  computer  aid  used  by  the  AFPRO 
overhead  cost  analyst.  Each  MODE  model  is  tailored  to  simulate  a 


particular  contractor's  cost  accounting  system.  MOTE  provides  greater 
visibility  of  overhead  cost  flow  from  incurrence  to  final  rate  development. 
These  models  have  been  implemented  at  most  APPRO  locations  where  they  are 
used  primarily  for  negotiation  of  Forward  Pricing  Rate  Agreements  or 
Recommendations  ( FPRA  or  FPRR) .  Detailed  information  concerning  the  imple¬ 
mentation  and  use  of  these  models  can  be  obtained  from  AFCMD/TMO. 

d.  SORT  (Settled  Overhead  Retrieval  Technique) 

The  continued  concern  at  OSD  and  Service  level  about  consistency 
in  the  way  overhead  is  managed  prompted  AFCMD/TMO  to  develop  a  computerized 
technique  to  maintain  a  track  on  the  treatment  of  overhead  cost  items  in 
final  settlement.  The  purpose  of  SORT  is  to  provide  management  and  the  ACO 
with  data  to  show  how  overhead  costs  are  being  treated  by  various  APPRO 
locations. 

When  fully  implemented,  the  SORT  data  base  will  contain  final 
settlement  findings  for  each  AFCMD  location  for  a  three-year  period.  Zn 
brief,  the  validated  user  will  be  able  to  make  inquiries  of  SORT  on  the 
basis  of  key  word  descriptors  or  DAR  cost  principles.  These  key  words  or 
DAR  principles  will  relate  to  various  overhead  final  settlement  findings. 
SORT  will  respond  by  providing  a  printout  of  information  pertaining  to  each 
finding  relative  to  the  particular  inquiry. 

Management  Information  Systems 

a.  Overview  of  Management  Information  Systems 

The  advent  of  the  computer  has  made  management  and  detailed  analy¬ 
sis  of  large  amounts  of  data  more  feasible  and,  in  turn,  more  fruitful.  A 
properly  organized  automated  data  base  is  in  many  cases  the  only  way  to 
convert  otherwise  massive  amounts  of  data  into  valuable  management  infor¬ 
mation. 

b.  AFCMD  Pricing  Management  Information  System 

An  information  data  base  in  terms  of  workload  management  is  the 
key  element  in  the  feedback  loop  to  management.  By  structuring  inputs  and 
storage  rules  based  output  needed  by  management,  an  eventually  indispen¬ 
sable  tool  can  be  created.  Figure  3-5  is  a  schematic  diagram  of  a  typical 
management  control  system  which  incorporates  an  automated  feedback  loop. 

• 

Until  1976,  automated  workload  management  systems  under  COPPER 
IMPACT  were  developed  purely  on  a  local  basis  by  individual  user  organiza¬ 
tions.  These  systems  utilize  a  variety  of  approaches  to  the  problems  of 
output  requirements,  required  inputs  and  information  processing  methodo¬ 
logy.  Based  on  a  discerned  need  at  the  headquarters  level  for  enhanced 
pricing  workload  data,  the  AFCMD  Pricing  Division  developed  a  centralized 
pricing  workload  data  base  in  1975.  This  Pricing  Management  Information 
System  (PMIS)  will  serve  the  information  needs  of  both  field  pricing  chiefs 
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in  local  resource  management  and  headquarters  personnel  in  pricing  policy 
development*  The  data  base  design  anticipates  application  to  both  AFPRO 
and  buying  activity  pricing  offices/  and  incorporates  optional  data  to 
record  manhour  data  for  each  pricing  case*  Detailed  information  and  user 
guides  can  be  obtained  from  the  Air  Force  Contract  Management  Division  / 

Pricing  Division  (AFCMD/TMF,  Kirtland  APB/  tM  87117). 

c*  Contractor  Rates  System 

Description*  The  Contractor  Rates  System  (CONRATES)  is  an  automated 
data  bank  designed  to  contain  rates  and  factors  used  by  price  analysts  and 

buyers  in  the  task  of  forward  pricing  of  contracts.  The  data  bank  was  not 

intended  to  contain  historical  rates  and/or  factors. 

CONRATES  includes  data  on  a  contractor-by-contractor  basis  for 
each  of  up  to  five  different  categories:  direct  labor  rates;  overhead 

rates;  direct  cost  factors;  contract  performance  information;  and  factors 

used  in  pricing  technical  data.  At  the  present  time/  emphasis  is  being 
placed  on  use  of  the  first  three  sections  of  the  data  bank. 

Each  section  of  CONRATES  for  a  particular  contractor  can  contain 
up  to  five  years  of  data  in  any  of  four  forms:  monthly,  quarterly,  semian¬ 
nual,  and  annual.  The  system  provides  for  two  types  of  rates  to  be 
included;  these  are  ACO  recommended  or  negotiated.  There  is  no  practical 
limitation  on  the  number  of  rate  categories  included  in  each  section. 

Responsibility  for  input  and  maintenance  of  rates  and  factors  for 
a  particular  contractor  rests  with  the  cognisant  Principal  Administrative 
Contracting  Officer  (PACO).  As  a  practical  matter,  data  will  only  be  input 
for  the  locations  having  a  resident  contract  administration  office.  All 
such  locations  are  encouraged  to  participate  in  the  data  bank,  but  emphasis 
is  placed  >n  those  locations  having  a  significant  volume  of  pricing  activity 
with  more  than  one  buying  activity. 

Access  and  Security.  Access  to  CONRATES  is  restricted  at  several 
levels.  First,  the  organization  desiring  data  bank  access  must  have  or  have 
access  to  a  valid  COPPER  IMPACT  user  number /password  combination.  Second, 
the  user  must  know  the  five-digit  contractor  identification  code.  Third, 
the  user  number  of  the  organization  must  have  been  granted  authorization  by 
the  PACO  based  on  a  justified  need. 

Where  established  communication  channels  are  available,  telephonic 
requests  to  the  PACO  are  usually  sufficient  to  gain  the  necessary  approvals . 

In  any  case,  a  written  request  for  approval  should  be  provided  to  the  PACO 
for  his  files  and  the  user's  own  protection  in  the  unlikely  case  of  compro¬ 
mise  of  contractors'  financial  data. 

\ 

Using  CONRATES.  From  the  user's  standpoint,  OONRATES  is  compara¬ 
tively  simple  to  use.  Only  two  main  programs  are  involved;  the  first, 

'LISTCLC ' ,  generates  an  index  to  the  particular  contractor/ section  of  index 
and  the  second,  ' CONRATES ' ,  is  the  main  data  retrieval  and  maintenance  program. 
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The  primary  identifier  of  data,  after  the  contractor  identification 
(CIO) ,  is  the  category/level  code  (CLC).  ' LISTCLC'  generates  the  numeric 
CLCs  necessary  to  specify  a  report,  the  associated  nomenclature,  rate  type, 
and  period  of  data  loaded.  Once  a  user  has  established  the  CLCs  of  interest, 
'CONRATES '  is  run  in  the  REPORT  or  SELECT  mode.  The  REPORT  mode  generates  a 
neatly  formatted  report,  complete  with  contractor  identification  data,  con¬ 
taining  the  rates,  application  base  definitions  and  any  remarks  applicable 
to  each  of  the  CLCs  requested.  The  SELECT  mode  permits  the  user  to  specify 
a  given  CLC  and  time  period(s)  of  interest  which  are  returned  without  any 
special  report  format.  A  single  rate  for  one  time  period  may  be  retrieved 
in  the  SELECT  mode. 

Documentation .  Contract  administration  organizations  may  request 
the  CONRATES  Input  and  Maintenance  Guide  from  Headquarters  AFCMD/TMF, 

Kirtland  AFB,  MM  87117.  A  companion  User's  Guide  which  provides  instruc¬ 
tions  on  accessing  CONRATES  may  be  obtained  from  Headquarters  AFSC/PMMP, 
Andrews  AFB,  DC  20334. 

d.  Escalation  Data  Files 


Two  data  files  are  maintained  with  monthly  update  on  the  COPPER 
IMPACT  master  library  which  contain  information  useful  in  surveillance  of 
Economic  Price  Adjustment  (EPA)  changes.  The  files  are  described  here  for  your 
reference.  Additional  indexes  may  be  included  on  request  to  AFSC/PMMP, 

Andrew  AFB,  DC  20334. 

Wholesale  Price  Index  Data  File  (WPINDEX) . 

The  WPINDEX  file  contains  the  following  indexes  by  month  for  up 
to  seven  years  of  history: 

( 1 )  GNP  Implicit  Price  Deflator 

(2)  Consumer  Price  Index 

(3)  Wholesale  Price  Index  (All  Commodities). 

(4)  Wholesale  Price  Index  (Industrial  Commodities) 

(5)  Selected  detailed  indexes  based  on  those  used  in  AFSC  EPA  clauses. 


Hourly  Earnings  Rate  Data  ( HBRATES ) .  The  HE  RATES  file  includes  the 
average  gross  hourly  earning  rates  (including  overtime)  in  the  following  cate¬ 
gories  (Standard  Industrial  Classifications)  for  \q>  to  seven  years  of  history: 


( 1 )  Total  private 

(2)  Durable  Goods 


\ 


(3)  Selected  SIC  codes  presently  used  in  AFSC  EPA  clauses 
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e.  Data  Man* 


ni 


it  Syatam  (DMS) 


The  Data  Management  System  Is  a  contractor-provided  software  package 
designed  to  provide  a  flexible  tool  for  use  by  nonprogrammers  in  developing 
and  managing  a  data  base  of  any  kind,  the  system  employs  a  series  of 
Bnglish-like  statements  which  permit  the  user  to  establish  and  maintain  a 
lavge  data  base.  Reports  are  generated  quickly  and  easily  and  nay  be  deve¬ 
loped  for  production  use  or  one-time  ad  hoc  inquiries  to  the  data  base. 

Analysis  Aids 

Numerous  individual  programs  have  been  developed  under  COPPER 
IMPACT  whose  sole  purpose  is  to  aid  the  price/ financial  analyst  in  per¬ 
forming  in-depth  analyses.  In  addition,  the  contractor's  software  library 
contains  numerous  programs  of  interest  to  pricing  and  finance  personnel. 
These  programs  are  available  to  any  user  through  the  Copper  Impact  Library. 


3-3  USES  OF  THE  LEARNING  CURVE  IN  CONTRACT  ADMINISTRATION* 

Within  the  contract  administration  phase  of  the  contracting  process, 
there  are  several  uses  of  the  learning  curve  concept :  ( 1 )  planning  and 

scheduling  of  work  to  be  accomplished  by  a  contractor,  (2)  pricing  of 
replenishment  spare  parts,  (3)  evaluating  value  engineering  changes,  and 
(4)  pricing  engineering  change  proposals.  Bach  of  these  applications  is 
discussed  below. 

Planning  and  Scheduling 

The  learning  curve  can  approximate  the  number  of  man  hours  that  will 
be  required  to  produce  a  specific  number  of  items.  If  the  units  are  pro¬ 
duced  on  a  job  lot  basis,  planners  can  also  compute  how  many  man  hours 
should  be  scheduled  for  a  given  period.  For  example,  a  contractor 
experienced  an  80  percent  learning  curve  for  an  item  he  was  producing  with 
a  first  unit  "cost"  of  1,000  man  hours.  He  provided  the  following  produc¬ 
tion  delivery  schedule  to  the  government  indicating  the  man  hours  to  be 
expended  under  the  contract  (Table  3-3). 


Table  3-3 

Contractor's  Estimated  Man-Hour  Reguirements  and  Delivery  Schedule 


Delivery 

date 

Units  to  be 
delivered 

Planned  man¬ 
hour  expenditure  * 

Total  cumula¬ 
tive  manhours 
planned 

3  June 

10 

6,315 

6,315 

17  June 

10 

4,170 

10,485 

1  July 

10 

3,000 

13,485 

15  July 

10 

2,800 

16,285 

29  July 

10 

2,500 

18,785 

*AFIT.  Contract  Administration.  Volume  II,  Chapter  26,  "Use  of  the 
Learning  Curve  in  Contract  Administration".  1975. 


These  data  were  plotted  with  a  dotted  line  (scheduled  nan  hours)  on 
log- log  paper  in  Figure  3-6.  The  analyst  then  plotted  the  cumulative 
average  man  hours  required  assuming  an  80  percent  learning  curve. 
Examination  of  the  data  revealed  that  the  planned  man  hours  would  not  meet 
the  delivery  schedule  required  by  the  contract.  The  government's  admi¬ 
nistrative  contracting  officer  (AGO)  was  advised  that  the  contractor  would 
likely  be  five  units  short  by  the  final  delivery  date  unless  there  was  a 
rescheduling  of  manpower.  The  analyst  suggested  to  the  AGO  that  the  sche¬ 
dule  reflected  in  Table  3-4  was  more  adequate  in  terms  of  manpower .  unless 
the  contractor  increased  the  planned  man  hour  expenditure,  he  would  be 
unable  to  meet  his  delivery  schedule.  The  ACO  informed  the  contractor  of 
the  discrepancy  and  requested  a  more  realistic  scheduling  to  prevent 
default  of  the  contract. 


Table  3-4 

Government's  Revised  Estimated  Man-hour 
Requirements  and  Delivery  Schedule 


Delivery 

date 

Units  to  be 
delivered 

Planned  man¬ 
hour  expenditure 

Total  cumula¬ 
tive  manhours 
planned 

3  June 

10 

6,315 

6,315 

17  June 

10 

4, 170 

10,485 

1  July 

10 

3,535 

14,020 

15  July 

10 

3,173 

17,193 

29  July 

10 

2,929 

20,122 

Pricing  Spare  Parts 

Direct  labor  is  one  of  the  major  costs  in  most  production  contracts. 

The  learning  curve  is  used  for  the  purpose  of  estimating  a  few  high  cost 
items  produced  over  a  long  period  of  time  and  requiring  a  large  amount  of 
labor  expended  per  unit.  Suppose  a  contractor  submitted  the  following  unit 
price  proposal  to  the  ACO  for  the  procurement  of  50  units  of  a  spare  part: 


Direct  material  (2,000  lbs  4  $5.00  per  lb)  ....  $10,000 

Direct  labor  (500  hrs  4  $20  per  hr) . 10,000 

Manufacturing  overhead  (4  100%) . 10,000 

30,000 

General  and  Administrative  expenses  (45%)  .  .  .  .  1 ,500 

31,500 

Profit  (4  10%) .  3,150 

Unit  Price  Proposed  $34,650 


Suppose  that  the  contractor  attempted  to  justify  his  proposal  on  the  basis 
of  an  80  percent  learning  curve.  If  so,  the  cumulative  average  labor 
requirement  for  50  units  would  be  402.4  man  hours.  At  $20  per  man  hour, 
the  contractor  should  have  quoted  $’dJ48  for  direct  labor.  If  all  other 
factors  remain  the  same,  the  contractor's  proposal  might  be  revised  to 
reflect  the  following  government  negotiation  position: 
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Direct  Materiel  (2,000  lbs  C  95.00  per  lb)  .  .  .  .  910,000 


Direct  labor  (402.4  hr*  9  920  per  hr).  . .  8,048 

Manufacturing  overhead  (8  100%)  r- .  8,048 

26,096 

General  and  Administrative  expenses  (8  5%)  ....  1,305 

27,401 

Profit  (9  10%) . . . .  .  .  .  _  2,740 


Obit  Price  Negotiation  Ob j active  930,141 

Note  the  difference  of  94,509  in  the  unit  price  proposed  and  the  unit 
price  negotiation  Objective  suggested  by  the  application  of  the  learning 
curve.  Whether  the  goverixaent  can  achieve  its  negotiation  position  would 
depend  on  the  circumstances  surrounding  the  procurement  in  question.  The 
price  proposal  nay  be  inflexible  if  the  vendor  knows  that  he  is  in  a  sole 
source  position  or  that  the  matter  is  urgent  to  the  government  or  that  some 
other  factor  strengthens  his  bargaining  position. 

Evaluation  of  Value  Engineering  Chances 

A  value  engineering  change  proposal  (VECP)  may  result  from  a  contrac¬ 
tual  provision  which  encourages  the  contractor  to  submit  cost  reduction 
ideas.  Such  proposals  are  reviewed  by  the  ACO  who  makes  a  recommendation 
for  approval  or  disapproval  by  the  principal  contracting  officer  (PCO).  _ 
In  submitting  a  VECP,  the  contractor  must  submit  all  necessary  cost  and 
technical  data  for  evaluation  by  the  government.  Among  the  cost  factors 
to  be  examined  is  direct  manufacturing  labor.  When  circumstances  are  appro¬ 
priate  and  the  necessary  prerequisites  are  met,  the  contractor  may  estimate 
the  man  hours  required  based  on  the  leaning  curve.  A  difference  in  the 
degree  of  slope  may  make  one  part  of  an  item,  which  is  proposed  to  be 
substituted  for  another,  more  economical  and  therefore  more  desirable  in 
considering  a  VECP.  The  following  exaqsle  shows  how  the  leaning  curve 
be  used  in  the  decision  to  accept  a  VECP. 

~ffie  contractor  proposed-to  substitute  part  's  for  pert  *  in  an  end 
item.  Because  part  B  is  described  by  a  70  percent  learning  curve  and  part 
A  by  an  82  percent  curve,  the  contractor  claims  that  part  B  can  be  produced 
less  expensively  than  part  A  even  though  the  cost  of  producing  the  first 
unit  would  be  much  higher.  Suppose  the  first  unit  of  part  A  costs  9350, 
and  the  first  unit  of  B  costs  9500  (sea  Figure  3-7).  The  costs  at  unit  12 
would  be  the  same  for  units  A  and  B,  but  for  successive  units,  part  B  would 
cost  less.  If  the  government  planned  too  buy  400  units,  the  400th  unit  of 
part  A  would  cost  963  in  comparison  to  923  for  part  B.  In  terns  of  total 
costs,  the  difference  is  even  greater.  For  a  procurement  of  400  units, 
part  A  would  cost  916,883  more  than  part  8.  Assuming  technical  accep¬ 
tability,  it  would  be  to  the  advantage  of  the  govenoMnt  to  accept  the  VECP 
provided  that  the  costs  of  implementing  the  change  are  not  large. 

Prlcinc  Engineering  Change  Proposals  -  - .  . 


The  ACO  sostetlmes  negotiates  agreements  incorporating  approved  engi¬ 
neering  change  proposals.  The  initial  step  is  to  measure  the  extent  of  the 
change.  Tor  this  purpose,  the  best  measurement  may  be  the  percentage  of 
changed  effort.  Either  work  standards,  if  they  are  available,  or  engi¬ 
neering  estimates  may  be  used.  Such  estimates  are  based  on  experience  with 
similar  components.  The  nature  of  the  estimating  systams  used  varies 
widely  from  oompeny  to  company.  One  simple  and  crude  estimate  of  the  per¬ 
centage  of  change  is  to  relate  the  number  of  blueprints  to  be  changed  with 
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learning  curve  and  cost  volume  relationship 


the  total  number  of  blueprints  covering  the  entire  item  or  component.  To 
use  the  learning  curve  appropriately,  the  estimator  must  be  able  to  express 
the  engineering  change  proposal  by  a  percentage  in  terms  of  the  total  work 
to  be  done. 

A  phenomenon  of  the  learning  curve  is  this:  when  change  is  intro¬ 
duced,  learning  tends  to  begin  anew.  In  other  words,  if  all  production 
required  new  learning,  the  man  hour  reduction  following  the  first  unit  and 
the  resulting  efficiencies  would  be  lost.  Under  this  concept,  the  amount 
of  labor  required  for  changing  the  first  unit  agrees  in  percentage  with  the 
amount  of  change. 

The  following  example  assumes  an  instance  in  which  one  equal  part  is 
being  substituted  for  another  during  the  production  of  an  end  item.  The 
change  affects  the  units  between  number  300  and  number  500,  a  total  of  200 
units.  The  change  deletes  10  percent  of  the  old  work  and  adds  10  percent 
new  work.  The  procedure,  using  learning  curve  tables,  for  computing  the 
value  in  hours  of  the  changed  portion  of  the  work  is  as  follows: 

First,  consult  the  learning  curve  cumulative  total  tables  for  the  rate 
of  500  units  and  the  rate  of  300  units  on  the  80  percent  learning  curve. 
(Figure  3-8) . 


500  units  =  98.847 
300  units  *  69.663 
200  units  *  29.184 

Second,  apply  the  formula  F  times  R  to  find  the  total  number  of  man 
hours  which  would  be  expended  if  no  change  occurred. 

F  equals  the  man  hour  value  of  1st  unit  =  500  man  hours 

R  equals  the  rate  for  R^qq  minus  RjQn  “  29.184 

500  x  29.184  ■  14,592  manhours  for  200  units 

Third,  apply  the  same  formula  to  the  amount  of  unchanged  work. 

90  percent  X  F  ■  450  man  hours 

450  man  hours  X  29.184  -  13,133  man  hours  for  200  units 

Fourth,  calculate  the  value  in  hours  for  the  changed  portion  of  the 
work.  F  for  the  changed  portion  of  the  work  equals  10  percent  times  500  or 
50  man  hours.  Assume  the  same  slope  for  learning  on  the  changed  portion  of 
the  work  as  for  the  unchanged  portion  of  the  work. 

R  for  200  units  of  the  changed  portion  starting  at  unit  1  ■  52.720 

50  man  hours  X  52.720  =  2,636  maun  hours 

Fifth,  add  the  amount  of  unchanged  work  to  the  amount  of  changed  work 
for  the  total  number  of  man  hours  for  200  units. 

13,133  +  2,636  -  15,769  man  hours 


3-2  7 


Figure  3-8 

Segment  of  a  Learning  Curve  Ral 


80$  Curve 


Unit 

Number 

Cumulative 

Total 

Cumulative 

Average 

Unit 

1 

1 .00000000 

1.0000 0000 

1.00000000 

10 

6.31537300 

0.63153730 

0.47650987 

20 

10.48494279 

0.52424714 

0.38120790 

30 

14.01989341 

0.46732978 

0.33455934 

40 

17.19345570 

0.42983639 

0.30496632 

50 

20.12171364 

0.40243427 

0.28382707 

300 

69.66337918 

0.23221126 

0.15942083 

500 

98.84724744 

0.19769449 

0.13524640 

Extract  from:  The  Experience  Curve  Tables.  Vole.  1  and  2, 
Missile  Command,  Redstone  Arsenal,  Alabama,  1962. 


•  S.  Army 


Sixth,  the  cost  of  the  change  is  the  difference  in  the  total  man  hours 
of  the  changed  procurement  and  the  total  man  hours  of  the  original  procurement 

15,769  -  14,592  -  1,177  added  man  hours  (cost  of  the  change) 


This  example  concerned  the  substitution  of  one  equal  part  for  another 
in  the  end  item.  Other  changes  may  cause  either  an  addition  to  or  a  dele¬ 
tion  from  the  end  item.  If  work  is  added,  the  procedure  is  the  same  except 
that  no  work  is  deleted.  If  the  change  resulted  in  a  deletion,  the  percen¬ 
tage  of  old  work  would  be  reduced,  and  no  new  work  would  be  added. 
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CHAPTER  4 


MAINTENANCE 


Maintenance  is  the  process  employed  to  keep  equipment  in  condition  for 
effective  use  or  to  return  it  to  that  condition  when  it  malfunctions  or 
fails.  It  involves  repair,  overhaul,  servicing,  calibration,  inspection, 
modification,  and  manufacture.  The  Air  Force  delineates  five  primary 
objectives  of  maintenance  (AFR  66-14):  (1)  Develop  and  maintain  the  mili¬ 

tary  and  industrial  capability  needed  to  sustain  Air  Force  operations  under 
all  conditions;  (2)  Insure  organizational  structures  are  designed  on  the 
basis  of  a  wartime  operational  concept  to  provide  an  m-being  maintenance 
support  capability  to  meet  all  operational  requirements;  (3)  Establish  Air 
Force  maintenance  systems  and  techniques  that  will  be  responsive  to 
changing  operational  requirements  and  technology;  (4)  Insure  that  Air  Force 
material  is  serviceable,  operable,  and  configured  to  meet  the  mission 
requirements;  and  (5)  Insure  that  maintenance  planning  begins  in  the  con¬ 
ceptual  phase  of  each  new  system  or  equipment.  The  airlines  define  their 
objectives  for  maintenance  more  succinctly,  and  from  a  different  perspec¬ 
tive,  viz.,  "To  prevent  deterioration  of  the  inherent  design  levels  of 
reliability  and  operating  safety  of  the  aircraft  and  to  accomplish  this 
protection  at  the  minimum  practical  costs." 

In  the  Air  Force,  equipment  maintenance  is  the  largest  facet  of 
logistics  in  terms  of  money,  manpower,  facilities,  and  many  other  important 
resources.  As  such,  maintenance  is  considered  a  major  factor  In  military 
capability. 

The  maintenance  system  may  be  considered  as  a  "production  system"  and  is 
generally  operated  in  parallel  with  the  organization's  primary  production 
system.  Maintenance  management,  therefore,  employs  essentially  the  same 
management  tools  and  techniques  as  found  in  the  production/operations  man¬ 
agement  area.  Because  of  the  complexity  of  many  of  the  problems  found  in 
maintenance,  heuristics  (e.g.,  "Johnson's  Rule"  for  scheduling)  are  used 
extensively.  Simulation  is  another  tool  which  is  finding  wider  application 
m  the  maintenance  area.  The  problems  illustrated  in  this  section  employ 
techniques  which  can  be  found  in  any  comprehensive,  senior  undergraduate  or 
beginning  graduate  level  text  on  production/operations  management. 

4” t  ASSEMBLY  LINE  BALANCING 


An  emergency  7CT0  on  the  F-15  requires  seventeen  tasks  to  complete. 

The  tasks,  designated  by  letter,  their  performance  times,  and  sequence, 
are  listed  in  Table  4-1.  A  fleet  of  72  aircraft  must  be  run  through 
the  TCTO  process.  The  DCM  wants  the  job  completed  in  no  more  than  six 
days.  The  wing  works  two  shifts  per  day,  seven  productive  hours  per  shift. 
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The  DCM  wants  to  set  up  an  assembly  line  type  of  operation.  How  many 
work  stations  should  he  use?  What  tasks  do  they  consist  of? 


Table  4-1 


TASKS,  PERFORMANCE  TIMES,  AND  PRECEDENCE 


TASK 

PERFORMANCE 

TIME  (MIN) 

TASKS  THAT 

MOST  PRECEDE 

A 

10 

— — , 

B 

5 

A 

C 

15 

A,B 

D 

15 

— 

E 

30 

D 

F 

15 

E,F 

6 

45 

A,B,C,D,E/F 

H 

IS 

A,B,C 

1 

20 

D,E,F 

J 

25 

A,B,C,D,E,F,G,H 

K 

10 

D,E,F,  I 

L 

25 

D,B,F,  X 

M 

40 

A|BfCf Df E;F ,G,H, J 

N 

25 

D,E,F , I,K 

O 

30 

D,E ,F, I,L 

P 

50 

A,B,C,D,E,F,G,H,I,J,K,L,M,N,0, 

Q 

15 

A,B,C,D,B,F,G,H,I, J,K,L,M,N,0,P 

390 

This  is  an  assembly  line  balancing  problem.  The  layout  of  the 
assembly  line,  in  terms  of  the  number  of  work  stations  required,  is  based 
on  the  output  required  from  the  line  and  the  desire  for  an  efficient 
operation,  i.e.,  one  that  gets  the  job  done,  on  time,  with  a  minimum 
amount  of  idle  or  wasted  time. 

The  first  problem,  then,  is  to  determine  the  output  required,  or 
demand  (0).  Since  there  are  72  aircraft,  and  they  must  all  be  completed 
in  no  more  than  six  days,  demand  is: 


72  acft 

D  m  m  ^2  acft/day 


Next,  we  aMd  to  know  tho  productive  tut  (P)  por  day  available  to 
perform  tho  TCTO.  This  tiao  1st 

P  »  2  shifts/day  X  7  hours/shift  -  14  hrs/day 

Proa  Table  4-1 ,  tho  total  tlmo  (T)  required  for  the  seventeen  tasks 
in  the  TCTO  is  390  minutes /  or  6.5  hours. 

The  number  of  work  stations  required  (H)  is  then: 

hrs  aeft 

6.5  soft  X  12~~day 

hrs  -  5.57 

14  day 


TXD 


Therefore:  six  work  stations  are  required. 

The  cycle  time  (C)  required  to  atet  the  demand,  l.e. ,  the  maximum 
time  permissible  between  aircraft  ooming  off  the  assembly  line.  Is  P/D  - 
14/12  -  1.167  hrs.  or  70  minutes.  Since  the  maximum  task  time,  task  P. 
is  50  minutes ,  we  can  meat  the  demand  with  a  properly  balanced  line. 

In  order  to  see  what  this  line  would  look  like,  we  need  to  allocate 
tasks  to  different  work  stations,  within  the  given  precedence  constraints 
In  order  to  better  visualize  these  precedence  constraints,  a  precedence 
diagram  is  dram  from  the  data  in  Table  4-1.  This  diagram  is  shorn  in 
Figure  4-1. 


Figure  4-1 
Precedence  Diagram 


To  make  the  first  line  balance,  we  will  use  a  rule  which  states: 
"First  allocate  those  tasks  that  have  the  longest  operation  time.” 

In  case  of  a  tie,  we  will  allocate  the  task  with  the  largest  number 
of  following  tasks.  The  results  of  employing  this  rule  are  shorn  in 
Table  4-2. 
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Table  4-2 


Balance  Made  According  to  Longest 

Operating  Time  (70  Min  Cycle  Tima) _ 

REMAIN IMG  FEASIBLE 

UNASSIGNED  REMAINING 

TASK _ TIME _ TIME  (MIN) _ TASKS 


D 

15 

55 

A,E 

E 

30 

25 

A«F 

F 

15 

10 

A 

A 

10 

0* 

None 

1 

20 

50 

b,k,l 

L 

25 

25 

B»K 

K 

10 

15 

B 

B 

5 

10* 

None 

O 

30 

40 

N,C 

N 

25 

15 

C 

C 

15 

0* 

None 

G 

45 

25 

H 

H 

15 

10* 

None 

J 

25 

45 

M 

M 

40 

5* 

None 

P 

50 

20 

Q 

Q 

15 

5* 

None 

♦idle 

♦♦total 

time  per  station 
idle  time  for  line 

30** 

From  the  table,  we  see  that  Indeed  six  work  stations  can  be 
structured  and  rebult  In  a  line  with  30  minutes  total  idle 
(Figure  4-2 )•  The.  efficiency  of  this  line  may  be  computed  as: 

\  390 

Efficiency  (%)  ■  —————  %  too  -  93% 

390  +  30 


also. 


390 

6(70) 


Efficiency 


93 


4-2  SCHEDULING  JOBS  THROUGH  OMB  PROCESS  OR  MACHINE 


The  supervisor  of  the  "wash  rack"  facility  (WRF)  at  Cormorant 
Air  Logistics  Center  has  been  receiving  oomplaints  from  the  Component 
Repair  Section  (CRS)  that  jobs  are  taking  too  long  on  the  average  to 
go  through  the  steam  cleaning  process.  This  delays  component  overhaul 
and  return  to  the  customer.  The  CRS  supervisor  claims  the  mean  flow 
time  through  the  steam  cleaning  process  has  been  over  18  hours  and  he 
must  have  a  mean  flow  time  of  less  than  12  hours  if  he  is  to  increase 
the  CRS 1 s  probability  of  meeting  its  schedule. 

The  WRF  supervisor  examined  his  records  and  found  that  the  same 
10  jobs  accounted  for  80  to  90  percent  of  his  CRS  work  load.  These  jobs 
and  their  typical  processing  time  for  steam  cleaning  are  shorn  in  Table  4-3 
below. 

Table  4-3 


Job 

Steam  Cleaning 
Time  (Hrs) 

ACQ  121 

2.25 

BRC  154 

1.75 

CRC  58 

0.75 

DNA  111 

0.50 

ELEC  6A 

3.75 

FQR  3 

1.25 

GIM  JT 

2.50 

HYD  1A 

6.00 

IND  7 

3.00 

JP  4A 

3.25 

The  normal  procedure  was  for  the  WRF  supervisor  to  get  all  the  job 
orders  at  the  beginning  of  the  week  for  that  week's  work.  He  would  then 
schedule  the  jobs  that  his  experience  told  him  ware  most  difficult  and 
time-consuming  first,  in  order  to  "get  them  out  of  the  way."  He  would 
then  schedule  the  remaining  easier  jobs.  Now  he  was  being  told  that 
the  jobs  were,  on  the  average,  taking  too  long.  He  wandered  how  he 
might  minimize  the  mean  flow  time  through  his  steam  cleaning  process. 


This  is  a  job  shop  scheduling  situation,  with  a  static  job  arrival 
pattern.  It  falls  into  that  class  of  problem  termed  an  ”n  job— one-machine 
problem."  In  this  type  of  problem,  the  shortest  processing  time  rule 
always  yields  the  optimum  solution. 


In  this  case,  the  WRF  supervisor  is  actually  scheduling  the  jobs 
with  the  longest  processing  time  first*  Thus,  he  would  schedule  the  jobs 
m  Table  4-3  according  to  the  sequence  shown  in  Figure  4-3*  The  mean  flow 
time  for  this  schedule  is  computed  by  summing  the  flow  times  for  all 
jobs;  i.e.,  6.00  +  9.75  +  13.00  +  16.00  +  18.50  +  20.75  +  22.50  +  23.75  + 

24.50  +  25.00,  and  dividing  by  the  number  of  jobs,  10,  i.e.,  179.75/10  « 
17.975.  Thus,  the  mean  flow  time  for  this  scheduling  sequence  is  almost 
18  hours,  well  over  the  12-hour  mean  requested  by  the  CRS  supervisor. 

Using  the  shortest  processing  time  rule,  the  jobs  would  be  sequenced  as 
shown  in  Figure  4- 4  and  the  minimun  mean  flow  time  would  be:  (.50  *  1.25  + 

2.50  +  4.25  +  6.50  +  9.00  +  12.00  +  15.25  +  19.00  +  25.00)»  10  -  9.525. 
Thus,  use  of  the  shortest  processing  time  rule  almost  halves  mean  flow 
time  and  allows  the  WRF  supervisor  to  meet  the  requirement  for  less  than 
12  hours  mean  flow  time. 


Figure  4-3 


H 

E 

J 

I 

G 

A 

B 

F 

C 

D 


Longest  Processing  Time  Schedule _ 

_ 6.00 

_ 9.75  PROCESSING  TIME 

_ 13.00 

_ 16.00 

_ 18.50 

WAITING  TIME  _ 20.75 

22.50 

23.75 

24.50 

25.00 

5 _ 10 _ 15 _ 20 _ 25 


TIME  (HRS) 


Figure  4-4 
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The  supervisor  of  the  Component  Repair  Squadron  (CRS)  at  Korona  AB 
Centralised  Intermediate  Repair  Facility  (CIRF)  was  reviewing  the  jobs  he 
had  to  schedule  for  the  upcoming  week*  He  was  concerned  because  operating 
bases  were  complaining  about  the  slow  turnaround  they  were  getting  on  their 
repair  requests*  The  CRS  supervisor  had  been  told  to  get  the  current  batch 
of  jobs  out  ASAP  and  not  later  than  within  48  hours.  The  CRS  supervisor 
wondered  whether  by  proper  scheduling  he  might  get  all  the  jobs  through  the 
shop  within  the  time  required,  using  only  the  men  and  equipment  available. 
The  jobs  and  the  processing  time  for  each  job  on  each  of  three  machines 
they  must  be  run  through  are  listed  in  Table  4-4,  below.  Each  job  must  go 
across  machine  I  first,  then  machine  II,  then  III,  in  that  order. 

Table  4-4 


MACHINE  (HRS) 

"i  I  II  I  III 


TOTAL 


Mm-6  Max-6  Min —6 

This  is  an  n/3  flow  shop  scheduling  situation,  with  a  static  job  arrival 
pattern.  In  this  type  of  problem,  an  optimal  solution  can  be  obtained  by 
an  extension  of  "Johnson's  Rule"  for  the  n/2  case. 


In  this  problem,  sum  job  processing  times  an  Machines  I  and  II,  then  sum 
job  processing  times  on  Machines  II  and  III.  This  will  give  two  columns  of 
times  as  shown  in  Table  4-5  below.  Apply  "Johnson's  Rule"  to  this  tableau, 
to  determine  job  sequence*. 

Table  4-5 


MACHINE  (HRS) 


I  6  II 


II  6  III 


SEQUEM 
JOHNSON ' 


Plotting  the  jobs  in  the  predetermined  sequence  across  each  machine  as 
shown  In  Figure  4-5,  below,  indicates  that  the  jobs  may  be  completed  in 
48  hours. 


Figure  4-5 

Sequence  of  Jobs  on  Machines 


MACHINE  JOB 


It  should  be  noted  that  Johnson's  Pule  will  give  an  optimum  solution  for 
the  n-job,  3 -machine  problem  only  where  the  maximum  operation  time  on  the 
second  machine  is  less  than  or  equal  to  the  minimum  operation  time  on 
either  of  the  other  machines,  i.e.. 


min  til  ^  max  ti2  or  min  ti3  ^  max  t^2 


Where  t^j  ■  processing  time  of  the  ith  job  on  the  jth  machine. 


LEARKIHG  CURVE  IN  ESTIMATING  MAINTENANCE  MANPOWER 


4-4  USE  OF  THE 

REQUIREMENTS 

A  contractor,  selected  to  produce  a  new  aircraft  for  the  USAF,  has 
indicated  to  the  government  that  the  aircraft  can  be  expected  to  require 
20  maintenance  man-hours  per  flying  hour  ( MH/FH )  approximately  18  months 
after  it  Is  introduced  into  the  operational  Air  Force*  During  Operational 
Test  and  Evaluation,  the  first  aircraft  off  the  line  experienced  approxi¬ 
mately  100  MH/FH1 s.  You've  been  asked  to  evaluate  the  contractor's 
MH/FH  projection,  subsequent  to  that  MH/FH  factor  being  used  to  estimate 
downstream  maintenance  manpower  requirements*  The  contractor  has  Indi¬ 
cated  that  the  aircraft  is  similar  to  the  F-222,  currently  In  the  USAF 
active  Inventory.  Data  for  the  first  18  months  of  operation  for  the 
F-222  is  shown  In  Table  4-6  below. 


Table  4-6 


Flying  Hours 

Per 

Month 

Man-Hours 

Per 

Month 

108 

9,601 

187 

19,298 

256 

17,894 

490 

28,322 

800 

42,480 

1106 

55,189 

1858 

91,042 

1927 

86,715 

1801 

79,064 

2700 

119,880 

3605 

144,560 

3659 

145,262 

4800 

168,480 

6400 

204,800 

7208 

223,448 

8000 

272,000 

7982 

278,572 

8640 

285,120 

There  are  a  number  of  ways  to  attack  this  problem.  Data  Is  scanty, 
and  numerous  assumptions  must  be  made.  Whatever  figure  is  finally 
derived  for  a  MH/FH  factor  will  have  to  be  considered  a  preliminary 
estimate,  subject  to  further  refinement  as  more  data  becomes  available. 


We'll  take  a  learning  curve  approach  to  this  problem.  Let's  examine 
the  contractor's  claims  in  terms  o£  what  the  learning  factor  has  to  be 
to  achieve  a  20  MH/FH  factor  in  the  18th  month  of  operation,  given  the 
1st  month '8  100  MH/FH  factor.  The  expression  for  the  Learning  Curve  is 

Yx  -  KXn 

Where : 

Yx *  man-hours/flying  hour  for  the  X*"*1  unit, 

K-"  "  "  "  "  1st  "  , 

X  =  the  Xth  unit 
n  *  log  b/log  2 
b  *  learning  factor 
For  our  case: 

20  =  100<18)1O9  b/1°*  2 
log  .20  -  log  b/log  2  (log  18) 
log  b  -  -.1676 
b  =  .68 

Thus,  the  contractor  is  expecting  a  68%  learning  factor— rather  optimistic 
relative  to  what  the  aircraft  industry  experiences,  which  is  closer  to 
80%. 

Looking  at  the  data  for  the  F-222  (Table  4-6) ,  one  can  see  that  as  the 
number  of  flying  hours  per  month  increases,  the  number  of  man-hours  per 
month  increases  as  well,  but  at  a  decreasing  rate.  We  can  analyze 
this  phenomenon  using  learning  curve  theory  by  reducing  the  man-hours 
per  month  to  man-hours  per  flying  hour  and  comparing  the  month  with  the 
MH/FH  figure  for  that  month.  We  can  use  regression  analysis  to  derive 
a  learning  factor  for  this  transformed  data.  The  month  represents  the 
independent  variable,  and  the  MH/FH 's  the  dependent  variable,  the 
expression  we'll  be  looking  for  will  be  the  log  transformation  of  the 
basic  learning  curve  equation ,  i • e . , 

Y  -  KXn 

log  Y  -  log  K  +  n  log  X 
which  takes  the  form 


The  data  pairs  useC  in  deriving  this  linear  bivariate  aodel  are  thus 
the  logs  of  the  X  and  Y  variables  shown  in  Table  4-7,  below. 

Table  4-7 


X  &  Y  Variables 


X 

Y 

1 

88.9 

2 

103.2 

3 

69.9 

4 

57.8 

5 

53.1 

6 

49.9 

7 

49.0 

8 

45.0 

9 

43.9 

10 

44.4 

11 

40.1 

12 

39.7 

13 

35.1 

14 

32.0 

15 

3-1. 0 

16 

34.0 

17 

34.9 

18 

33.0 

Using  a  standard  Linear  Regression  program,  such  as  found  in 
Statistical  Package  for  the  Social  Sciences  results  in  the  expression: 

Y  -  2.0293  -  . 4154X 

Remember  this  as:  log  Y  *  log  K  +  n  log  X 

We  translate  to:  log  K  *  2  0293 

and,  n  ■  log  b/log  2  *  -.4154 

then,  log  b  *  -.1250 

and,  b  *  .75 

If  the  contractor's  new  aircraft  is  similar  to  the  F- 222,  perhaps 
a  Learning  Factor  of  75%  would  be  a  more  reasonable  estimate  than  the 
68%  the  contractor  is  implying.  If  we  use  75%  for  the  new  aircraft,  we 
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find  for  the  18th  month: 


Y18  -  100 (18) 109  • 75/log  2 
*18  "  100(. 3013)  -  30.13 

Thus,  for  initial  planning,  a  MH/FH  factor  of  approximately  30 
for  the  18th  month  of  operation  appears  to  be  a  better  estimate  than  the 
20  MH/FH* s  the  contractor  has  predicted. 

What  impact  does  a  10  MH/FH  change  have  an  aircraft  maintenance 
manpower  estimates? 

Aircraft  maintenance  manpower  requirements  can  be  computed  based 
on  4  factors: 

(1)  The  MH/FH  factor  for  the  weapon  system  under  study,  e.g., 

the  F-222. 

(2)  The  Unit  Equipment  (UE) — number  of  aircraft  possessed — for 
the  squadron,  wing,  etc.,  under  consideration. 

(3)  the  programmed  flying  hours  (FH)  per  aircraft. 

(4)  The  productive  direct  manhours  available  per  month  per  man 

(AM). 

Mahpower  (M)  may  be  computed  as  follows: 


For  example,  suppose  the  F-222  has  a  MH/FH  of  30,  flies  25  hours/ 
month  and  is  assigned  to  6  wings  in  the  USAF  with  a  UE  of  72  aircraft 
per  wing.  Productive  direct  available  manhours  per  man  per  month  are 
86.4.  (This  is  a  standard  figure  based  on  60%  of  144  available  manhours 
per  month . )  Then , 

30  MH/FH  X  25  FH/MO  X  72  ACFT 

M  -  -  -  625 

86.4  MH/MO 

For  6  wings, 

-  6  X  625  -  3750 

Generally,  certain  factors  are  applied  to  this  figure  to  account 
for  functions  not  covered  by  the  MH/FH  factor,  such  as,  the  Chief  of 
Maintenance  Function,  Aerospace  Ground  Equipment,  Survival  Equipment, 
etc.  The  aggregate  of  these  factors  can  amount  to  117%  of  the  productive 


direct  manpower  figure,  or  in  our  example 


Mg  -  3750  X  1.17  -  4388 

At  20  MH/FH,  instead  of  30  MH/FH,  our  figure  would  be, 

20  X  25  X  72 

Mg  -  _  X  6  X  1.17  -  2925 

86.4 


a  difference  of, 

4388  -  2925  -  1463 

manpower  authorizations — over  240  per  wing!  This  could  make  a  big 
difference  in  wing  maintenance  capability.  Perhaps  more  importantly 
at  this  stage  of  manning,  it  makes  a  substantial  difference  in  budget 
planning.  At  about  $12,000  per  authorisation,  a  difference  of  463 
authorizations  equates  to  about  $17.6  million. 


4-5  BASE  LEVEL  EMERGENCY  INSPECTION  OF 


T-37  FLEET 


On  26  February  1979,  Air  Training  Command  (ATC)  sent  an  emergency 
message  to  all  Undergraduate  Pilot  Training  (UPT)  bases  (see  Exhibit  1 ) • 

The  message  restricted  all  T-37  aircraft  from  flight  in  excess  of  +3,0  Gs 
and  less  that  -1.5  Gs.  It  also  prohibited  aerobatic  maneuvers  exceeding 
these  G  tolerances  and  spins.  This  action  resulted  from  a  discovery  by  the 
San  Antonio  Air  Logistics  Center  (SAALC)  that  a  T-37  undergoing  normal 
programmed  depot  maintenance  had  two  cracked  wing  spar  bolts  and  one 
cracked  spar  attachment  fitting  (see  Exhibit  2).  The  seriousness  of  the 
situation  was  self-evident;  any  T-37  with  similar  structural  discrepancies 
risked  wing  separation  during  high  or  negative  G  maneuvering.  The  required 
corrective  action  was  a  one-time  inspection  of  all  T-37  aircraft,  to  be 
accomplished  at  base  level.  If  base  maintenance  found  the  wing  spar  bolts 
or  attachment  fittings  cracked,  they  were  to  be  replaced  at  the  base.  The 
SAALC  did  not  have  an  estimate  of  how  many  T-37s  actually  had  wing  spar 
bolts  or  attachment  fittings  cracked.  ATC  tasked  Reese  AFB  with 
constructing  a  plan  to  be  used  by  all  UPT  bases  for  efficient  accomplish¬ 
ment  of  the  one-time  inspection  and  corrective  action,  if  necessary.  The 
plan  was  to  be  submitted  to  ATC  by  10  March,  and  the  inspection  was  to  be 
started  on  15  March. 

The  activities  required  for  the  inspection  are  shown  in  Exhibit  3. 
Placing  the  wings  in  a  no  load  position  (activity  4)  requires  a  set  of  wing 
cradles.  Each  ATC  T-37  base  was  allocated  only  one  set  of  such  cradles. 

A  survey  of  the  bases  involved  in  the  inspection  was  accomplished  to 
determine  the  number  of  spare  bolts  and  fittings,  aircraft  possessed,  and 
personnel  manning.  This  information  is  shown  in  Exhibit  4.  How  should 
this  inspection  be  scheduled?  How  long  will  it  take  to  complete? 


This  problem  involves  the  use  of  activity  charting,  following  some 
simple  computations  to  determine  number  of  inspection  docks  to  employ.  The 
constraining  factor  in  the  inspection  would  appear  to  be  the  wing  cradles. 
Since  each  base  has  only  one  set  of  cradles,  it  is  necessary  to  minimize 
the  amount  of  time  they  are  on  any  one  aircraft.  Referring  to  Exhibit  3, 
the  sequence  of  activities  involving  the  cradles  appears  to  be  as  shown 
below. 


Min 

Place  wings  in  no 

load 

30 

Remove  bolts  and  fittings 

15 

Inspect  bolts  and 

fittings 

30 

Replace  bolts  and 

fittings 

15 

QC  Inspection 

30 

Remove  wings  from 

no  load 

30 

150  Hin< 


EXHIBIT  1:  THE  EMERGENCY  MESSAGE 


SUBJECT:  T-37  WING  SPAR  BOLT,  NUT,  AND  ATTACHMENT  FITTINGS. 


(1.)  T-37  TAIL  NUMBER  57-2344  WHILE  UNDERGOING  PDM  AT  SAALC  WAS 
FOUND  TO  HAVE  TWO  WING  SPAR  BOLTS  AND  ONE  WING  SPAR  ATTACHMENT  FITTING 
WHICH  WERE  CRACKED  AND  REQUIRED  REPLACING.  DUE  TO  THIS  OCCURRENCE,  A 
ONE-TIME  INSPECTION  OF  ALL  T-37  WING  SPAR  BOLTS,  NUTS,  AND  ATTACHMENT 
FITTINGS  WILL  BE  ACCOMPLISHED  NLT  180  DAYS  FROM  THE  RECEIPT  OF  THE  MSG. 
UNTIL  THIS  INS PEC TON  IS  ACCOMPLISHED,  ALL  T-37*  ARE  RESTRICTED  TO  +3.0 
Ga  and  -1.5  Gs,  AND  AEROBATICS  EXCEEDING  THESE  TOLERANCES  AND  SPINS 
ARE  PROHIBITED. 

(2.)  WING  SPAR  BOLTS  AND  ATTACHMENT  FITTINGS  WILL  BE  INSPECTED 
IAW  IT-378-6,  PARA  6-14.  CRACKED  BOLTS  WILL  BE  REPLACED  IAW  IT-37B-2-2 , 
PARA  3-16.  CRACKED  ATTACHMENT  FITTINGS  WILL  BE  REPLACED  IAW  IT-37B-2-6, 
PARA  3-12. 


(3.)  PRESENTLY  SAALC  HAS  136  WING  SPAR  BOLTS  (PN  500675-4)  AND 
48  ATTACHMENT  FITTINGS  (PN  500855-1)  IN  SUPPLY.  REQUIREMENTS  EXCEEDING 
BASE  STOCKS  SHOULD  BE  ORDERED  ASAP. 

(4.)  REESE  AFB,  TX  WILL  DESIGN  THE  INSPECTION  PLAN  TO  BE  USED  BY 
ALL  BASES.  THE  PLAN  MUST  BE  FURNISHED  TO  HQ  ATC/LG  NLT  10  MARCH  79. 

THE  PLAN  WILL  BE  DISSEMINATED  TO  ALL  BASES  ON  12  MARCH  AND  INSP  WILL 
BEGIN  ON  15  MARCH. 

(5.)  REESE  AFB,  TX  -  AFTER  INSPECTION  OF  10  AIRCRAFT,  YOU  WILL 
PROVIDE  ATC/LGS  WITH  A  FORECAST  OF  THE  EXPECTD  WING  SPAR  BOLTS  AND 
NUTS  REQUIREMENT  FOR  YOUR  BASE.  THIS  FORECAST  WILL  BE  USED  TO  REQUEST 
ADDITIONAL  SPAR  NUTS  AND  BOLTS  THROUGH  SUPPLY  CHANNELS. 

(6.)  THE  ATC  OPERATIONAL  READY  (OR)  RATE  OF  70%  MUST  BE 
MAINTAINED. 

(7.)  REESE  AFB  IS  AUTHORIZED  TO  START  INSPECTION  IMMEDIATELY  IN 
ORDER  TO  AID  IN  THE  DEVELOPMENT  OF  THE  MODEL  PLAN.  FINDINGS  AND  SOLUTIONS 
SHOULD  BE  INCLUDED  IN  THE  MODEL  PLAN. 
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EXHIBIT  2 


VISUAL  OH  REQUIRED  INSPECTION 
AREA  (WINGS  IDENTICAL) 


LOWER  ENGINE 
ACCESS  PANEL 


OPER  ENGINE 
ACCESS  PANEL 


EXHIBIT  3 


T.O.  IT-378-6,  PAHA  6-14,  Inspection  of  Wing  Spar  Bolts  and  Attachment 
Fittings • 


Activities 

Time 

Reouired  (Min) 

Personnel  Required 
Skill  Quantity 

1. 

Jack  Aircraft 

15 

APG 

2 

2. 

Be move  Engine  Panels 

15 

APG 

2 

3. 

Remove  Engines 

120 

ENG 

2 

HYD 

1 

4. 

Place  Wings  in  No-Load 

30 

MECH 

2 

5. 

Remove  Bolts  &  Fittings 

15 

MECH 

2 

6. 

Inspect  Bolts  &  Fittings 

30 

NDI 

1 

7. 

Replace  Bolts  &  Fittings 

15 

MECH 

2 

8. 

QC  Inspection 

30 

QC 

1 

9. 

Remove  wings  from  No-Load 

30 

MECH 

2 

10. 

Install  Engines 

120 

ENG 

2 

HYD 

1 

11. 

Replace  Panels 

15 

APG 

2 

12. 

Down  Jack  Aircraft 

15 

APG 

2 

Notes : 

1.  Inspection  will  take  place  in  hangar. 

2.  Towing  into  and  out  of  hangar  takes  15  min.,  and  requires  2  APG 
personnel,  and  2  other  personnel  of  any  skill  type. 

3.  Aircraft  will  require  engine  operational  check  and  PCF  prior  to  being 
returned  to  FTS.  This  takes  one  full  day,  on  the  average. 


Base 

Aircraft 

Spare 

Bolts 

Fittings 

Columbus 

48 

8 

3 

Laughlm 

36 

6 

2 

Reese 

72 

12 

4 

Vance 

48 

8 

4 

Williams 

72 

10 

3 

Sheppard 

48 

6 

2 

Personnel  Assigned 


APG 

Q 

HYD 

NDI 

QC 

54 

9 

5 

6 

3 

3 

43 

7 

3 

5 

2  j 

2 

82 

10 

7 

7 

3 

3 

51 

10 

3 

6 

3 

3 

88 

11 

6 

5 

3 

3 

56 

5 

5 

3 

3 

The  wing  cradles  are  tiad  up  for  ISO  minutes.  Therefore,  the  maximum 
number  of  aircraft  we  can  process  through  inspection  each  day  is: 

(24  Hr/Day)  (60  Min/Hr) 

_  -  9.6 

150  Min/Acft 

However,  30  minutes  of  the  time  the  cradles  are  employed  is,  in  effect, 
idle  time,  since  the  bolts  and  fittings  are  inspected  while  the  aircraft  is 
in  jig.  If  the  bolts  and  fittings  are  loaMdiately  replaced  with  good 
spares,  the  time  required  for  cradle  employment  can  be  reduced  20%  (from 
150  to  120)  and  the  maximum  number  of  aircraft  that  can  be  processed 
through  inspection  each  day  is: 

(24  Hr/Day)  (60  Min/Hr) 

_  -  12.0 

120  Min/Acft 

However,  the  total  time  required  for  the  inspection  (not  including  the 
engine  ops  check  or  aircraft  FCF)  is  450  minutes.  Therefore,  the  maximum 
number  of  aircraft  that  can  be  processed  through  inspection  in  any  24-hour 
period  is: 

24  x  60 

450  "3.2 

Therefore,  in  order  to  complete  the  inspection  on  12  aircraft  per  day,  we 
require  12/3  ■  4  inspection  docks.  In  order  to  determine  the  sequence  in 
which  the  aircraft  should  be  worked,  and  assess  the  personnel  impacts,  an 
activity  chart  can  be  constructed,  as  shown  in  Table  4-8. 

From  an  examination  of  the  activity  chart,  it  can  be  seen  that  the 
number  of  personnel  required  per  shift  and  the  total  personnel  requirements 
by  skill,  are  as  shown  in  Table  4-9,  below. 

Table  4-9 


Personnel  Requirements  by  81 


TOTAL  PERSONNEL 

SKILL 

PERSONNEL/SHIFT 

FOR  3  SHIFTS 

AP6 

2 

6 

ENG 

4 

12 

HYD 

2 

6 

2 

6 

gc 

1 

3 

NDX 

1 

3 

Table  4-8 


TOW 

JACK 

DEPANEL 


REPL  PANELS 
DOWN  JACK 
TOW 


TOW 

JACK 

DEPANEL 


REM  WINGS 
FM  NO  LOAD 


INSTALL 

ENGINES 


NO  LOAD  WINGS 


REM  BLTS/FIT 
REPL  BLTS/FIT 
QC  INSP 

REM  WINGS 
FM  NO  LOAD 


INSTALL 

ENGINES 


REPL  PANELS 
DOWN  JACK 


NO  LOAD  WINGS 

REM  BLTS/FIT 
REPL  BLTS/FIT 
QC  INSP 

REM  WINGS 
FM  NO  LOAD 


INSTALL 

ENGINES 


REPL  PANELS 
DOWN  JACK 


NO  LOAD  WINGS 

REM  BLTS/FIT 
REPL  BLTS/FIT 
QC  INSP 

REM  WINGS 
FM  NO  LOAD 


INSTALL 

ENGINES 


TOW 

JACK 

DEPANEL 

REMOVE 

ENGINES 


PI 


REPL  PANELS 
DOWN  JACK 


TOW 

JACK 

DEPANEL 


TOW 

JACK 

DEPANEL 

REMOVE 

ENGINES 


REPL  PANELS 
DOWN  JACK 


NO  LOAD  WINGS 

REM  BLTS/FIT 
REPL  BLTS/FIT 
QC  INSP 

REM  WINGS 
FM  NO  LOAD 


INSTALL 

ENGINES 


NO  LOAD  WINGS 

REM  BLTS/FIT 
REPL  BLTS/FIT 
INSP 


TOW 

JACK 

DEPANEL 


ENGINES 


REPL  PANELS 
DOWN  JACK 


NO  LOAD  WINGS 

REM  BLTS/FIT 
REPL  BLTS/FIT 
QC  INSP 

REM  WINGS 
FM  NO  LOAD 


INSTALL 

ENGINES 


242  1 
2422 
2422 
2422 
2422 
2422 
422 
42  1 
242 
2422 
2422  | 
2422 
2422 
2422 
422 
42 
242 
2422 
2422 
2422 
2422 
2422 
422 
42 
242 
2422 
2422 
2422 
2422 
2422 
422 
42 
242 
2422 
2422 
2422 
2422 
2422 
422 
42 
242 
2422 
2422 
2422 
2422 
2422 
422 
42 


T«bl*  4-8  Ccont.) 


735 

TOW 

nut  WINGS 

JACX 

765 

FM  NO  LOAD 

DEPANEL 

INSTALL 

NO  LOAD  WINGS 

REMOVE 

REPL  PANELS 

ENGINES 

DOWN  JACX 

REM  BLTS/FIT 

_ TOW _ 

REPL  BLTS/FIT 

99999999996S 

QC  INSP 

(99999999999 

TOW 

REM  WINGS 

JACK 

885 

FM  NO  LOAD 

DEPANEL 

900 

REPL  PANELS 

INSTALL 

NO  LOAD  WINGS 

REMOVE 

915 

DOWN  JACK 

ENGINES 

930 

REM  BLTS/FIT 

533939999% 

REPL  BLTS/FIT 

960 

9999999999 

QC  INSP 

975 

TOW 

990 

JACK 

REM  WINGS 

1005 

DEPANEL 

FM  NO  LOAD 

REPL  PANELS 

INSTALL 

NO  LOAD  WINGS 

ENGINES 

DOWN  JACK 

_ TOW _ 

REM  BLTS/FIT 

S99999999999 

REPL  BLTS/FIT 

0 

9999999999% 

QC  INSP 

TOW 

JACK 

REM  WINGS 

1125 

DEPANEL 

FM  NO  LOAD 

NO  LOAD  WINGS 

REPL  PANELS 

INSTALL 

1155 

ENGINES 

DOWN  JACK 

ENGINES 

1170 

REM  BLTS/FIT 

1185 

REPL  BLTS/FIT 

9999999999% 

QC  INSP 

99999999999 

1215 

TOW 

REM  WINGS 

JACK 

1245 

FM  NO  LOAD 

DEPANEL 

INSTALL 

NO  LOAD  WINGS 

REPL  PANELS 

ENGINES 

ENGINES 

DOWN  JACK 

REM  BLTS/FIT 

REPL  BLTS/FIT 

99999999999! 

QC  INSP 

99999999999! 

TOW 

REM  WINGS 

JACX 

1365 

DEPANEL 

1380 

REPL  PANELS 

INSTALL 

NO  LOAD  WINGS 

REMOVE 

1395 

DOWN  JACK 

ENGINES 

ENGINES 

1410 

REM  BLTS/FIT 

2099999990 

REPL  BLTS/FIT 

1440 

QC  INSP 

END  QF  DAT 
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Comparing  this  to  parsonnsl  assigned  by  base,  as  shown  in  Exhibit  4, 
gives  us  the  figures  in  Table  4-10.  Fran  an  examination  of  this  table,  we 
can  see  that  there  are  several  skill  shortages  among  the  bases.  Thus,  we 
cannot  operate  a  4-dock,  3-shift  per  day  operation  at  every  base. 

In  order  to  determine  what  our  options  are,  we  can  look  at  personnel 
requirements  for  different  operating  set-ups .  This  is  done  in  Table  4-11 
where  personnel  requirements  by  skill  are  shown  for  different  combinations 
of  docks  and  shifts  per  day.  As  can  be  seen  from  this  table,  there  is  no 
advantage  to  running  only  3  docks,  since  personnel  requirements  for  all 
skills  are  the  same.  Requirements  are  reduced  in  Engine  and  Hydraulic 
skills  when  only  two  docks  are  operated.  Reducing  the  number  of  shifts 
reduces  requirements  in  every  case. 

Next,  we  look  at  the  number  of  days  required  to  complete  the  inspection 
for  different  combinations  of  docks  and  shifts  per  day,  for  different  numbers 
of  aircraft  possessed,  i.e.,  36,  48,  or  72.  Examination  of  Table  4-12  indi¬ 
cates  that  the  minimum  amount  of  time  required  to  process  72  aircraft  through 
the  inspection  is  6  days.  If  we  decide  to  complete  the  inspection  for  all 
bases  in  6  days,  we  would  operate  4  docks,  3  shifts  per  day  at  Reese  and 
Williams,  which  possess  72  aircraft »  4  docks,  2  shifts  per  day  at  Columbus, 
Vance,  and  Sheppard,  which  possess  48  aircraft;  and  2  docks,  3  shifts  per 
day  at  Laughlin,  which  possesses  36  aircraft.  In  this  way,  the  inspection 
would  be  completed  in  6  days  at  all  ATC  bases. 

Reassessing  the  personnel  impacts  results  in  the  shortages  and  overages 
shown  in  Table  4-13.  In  order  to  coup  late  the  inspection  in  6  days  at  all 
bases,  some  shifts  in  personnel  assets  will  be  required.  For  example,  both. 
Reese  and  Williams  are  short  of  engine  specialists,  however,  all  other  bases 
have  excess  specialists.  Therefore,  the  option  of  sending  excess  specialists 
TOY  to  bases  with  shortages  should  be  explored. 
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N.  BASE 

sniiN. 

COL 

(48) 

LAD 

(36) 

REE 

(72) 

VAN 

(48) 

WXL 

(72) 

SHE 

(48) 

• 

RQD 

6 

6 

6 

6 

6 

6 

APG  ASGN 

54 

43 

82 

51 

88 

56 

+/- 

+48 

+37 

+76 

+45 

+82 

+50 

RQD 

12 

12 

12 

12 

12 

12 

9 

7 

10 

10 

11 

9 

+/- 

-  3 

-  5 

-  2 

-  2 

-  1 

-  3 

RQD 

6 

6 

6 

6 

6 

6 

HYD  ASGN 

5 

3 

7 

3 

6 

5 

+/- 

-  1 

-  3 

+  1 

-  3 

-  1 

RQD 

6 

6 

6 

6 

6 

6 

MECH  ASGN 

6 

5 

7 

6 

5 

5 

+/- 

-  1 

+  1 

-  1 

-  1 

RQD 

3 

3 

3 

3 

3 

3 

QC  ASGN 

3 

2 

3 

3 

3 

3 

V- 

-  1 

RQD 

3 

3 

3 

3 

3 

3 

NDI  ASGN 

3 

2 

3 

3 

3 

3 

+/- 

-  1 

Table  4-11 


PERSONNEL  REQUIRED  BT  SKILL.  BY  NUMBER  OF  DOCKS.  BY  NUMBER  OF  SHIFTS 


\MITTS 

3 

2 

1 

DOCKS 

■n 

E 

Cl 

n 

ES 

El 

El 

El 

a 

n 

El 

4 

6 

12 

Q 

B 

D 

D 

a 

a 

i 

i 

3 

6 

12 

a 

D 

3 

3 

B 

D 

a 

a 

2 

2 

2 

b 

2 

2 

i 

i 

2 

6 

6 

3 

6 

3 

B 

fl 

2 

a 

2 

2 

2 

r 

2 

1 

2 

i 

i 
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4.6  WORKLOAD  FORECASTING 

The  supervisor  of  the  Avionics  Antenna  Repair  Shop,  36  TFW,  at  Beilis  AFB 
is  attempting  to  forecast  his  work  load  requirement  for  the  coming  month,  July. 
Part  of  the  shop's  work  load  involves  repairing  antennas  for  the  F-15.  Since 
the  introduction  of  the  new  F-15  squadron  in  January,  the  number  of  F-15  antenna 
Line  Replaceable  Units  (LRUs)  received  for  repair  is  as  shown  in  Table  4-14, 
below.  Forecast  the  F-15  antenna  LRU  work  load  for  July. 

Table  4-14 


F-15  ANTENNA  LRU  REPAIR  REQUIREMENTS 


JAN 

FEB 

MAR 

APR 

MAY 

JUN 

23 

15 

26 

29 

24 

33 

A  graph  of  the  data  shown  in  Table  4-14  (see  Figure  4-6,  below)  illustrates 
what  appears  to  be  an  increasing  trend  in  the  number  of  antenna  LRU's  received 
for  repair. 


LRU' 


Number  of  Antenna  LRU's 
Received  for  Repair  by 
Month. 


Using  a  simple  moving  average  to  forecast  th  i  July  requirement  would  not 
reflect  this  trend.  For  example,  a  six-month  moving  average  is: 

23+  15  +  26  +  29+24  +  33 
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Where  a  system  appears  to  be  emerging  and  evolving,  a  better  approach  to 
forecasting  is  either  exponential  smoothing  using  relatively  large  values 
of  a  (the  smoothing  constant} ,  or  regression  analysis. 

In  this  problem,  an  exponentially  smoothed  forecast  for  July,  using  an  o  of 
.75  would  be: 

pt  ■  rt-i  ♦  -75  03  -  Ft-i> 

/ 

where  F^^  is  the  forecast  for  June. 

If  we  had  started  with  exponential  smoothing  in  January,  using  23  as  both  the 
forecast  and  actual  value,  the  exponentially  smoothed  forecasts  by  month  would 
be  as  shown  in  Table  4-15. 

Table  4-15 


EXPONENTIALLY  SMOOTHED  FORECASTS 


•actual 


For  July  then, 

Ft  -  24.9  +  .75  (33  -  24.91  »  30.975 

Plotting  the  forecasts  over  the  actual  values  in  Figure  4-6  results  in  the 
picture  shown  in  Figure  4-7.  It  can  be  seen  that  the  exponentially  smoothed 
forecast  lags  the  actual.  This  is  true  of  exponentially  smoothed  forecasts  in 
general. 

30 
25 

LRU's  20 
15 

5 

J 
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Using  regression  analysis,  we  can  fit  a  straight  line  to  the  data  using  the 
least  squares  method.  Computations  are  shown  in  Table  4-16. 

Table  4-16 


LEAST  SQUARES  COMPUTATION 


LRU'S 

MONTH 

2 

2 

Y 

X 

XY 

x2 

Y2 

23 

1 

23 

i 

529 

15 

2 

30 

4 

225 

26 

3 

78 

9 

676 

29 

4 

116 

16 

841 

24 

5 

120 

25 

576 

33 

6 

198 

36 

1089 

150 

21 

565 

91 

3936 

-  21  ,  _  —  150 

x  -  —  -  3.5  Y  -  —  "  25.0 

h* 

b  »  — — 

b2 

565  - 

-2  ‘  91  - 

-  nx 

6(3.51(25.0)  . 

6 (3.5) 2 

.286 

a  »  y  - 

bx  «  25.0  -  2 

.286 (3.5)  -  17.0 

Therefore 

Y  - 

a  +  bx  -  17.0 

+  2.286X 

Using  this  model  to  predict  the  LRU  work  load  requirement  for  July  yields: 

Y  -  a  +  bx  -  17.0  +  2.286(7)  -  33.002 

The  straight  line  plot  is  shown  superimposed  on  the  graph  of  actual  values  in 
Figure  4-8. 
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Figure  4-8 


•/ 


F  M  A  M  J  July 
MONTH 


Least  Squares  Line  and 
Actual  Values. 


lb*  Sun  Absolut*  Deviation  CMADL  for  each,  of  the  forecasting  methods,  expo- 
nan  tial  smoothing  CCS l  and  r agression  analysis  CPAl  can  ba  computed  as  shown 
in  Table  4-17.  Based  on  this  criterion,  the  forecast  for  July  should  be 
based  on  tbe  regression  model,  i.e.f  33.  The  disadvantage  of  using  this 
method  is  tbe  requirement  to  maintain  all  the  data  and  the  computational 
difficulty  (although  use  of  the  coaputer  or  an  advanced  calculator  eliminates 
the  latter  disadvantage!. 


CHAPTER  5 


TRANSPORTATION 


This  chapter  contains  several  informational  items  that  are  quite  often 
of  particular  concern  to  the  transportation  practitioner.  As  such,  they 
are  very  pragmatic  and  mundane.  The  subject  matter  ranges  from  calculation 
of  energy  consumption  by  mode  to  simply  finding  the  amount  of  liquid  in  a 
tank.  However,  these  are  the  kinds  of  simple  problems  that  usually  occur 
at  the  first  echelon  of  operations.  Many  of  the  items  were  taken  directly 
from  US  Army  Field  Manual  55-15,  Transportation  Reference  Data.  Others 
were  developed  for  this  chapter.  It  is  our  firm  hope  that  one  or  all  may 
be  of  use  to  you  as  you  function  as  a  transportation  practitioner. 

5. 1  LOGISTICS  IN  VIETNAM  -  HON  WE  DID  IT 


Mail  is  essential  to  morale,  especially  in  a  combat  zone.  Unfortunately, 
the  "official”  evaluation  of  its  importance  as  indicated  by  the  transpor¬ 
tation  two  priority  assigned  to  it,  is  low.  As  a  result  there  was  usually 
a  backlog  of  higher  priority  (i.e.,  a  priority  higher  than  two)  cargo  which 
should  have  taken  movement  precedence  over  mail,  no  matter  how  old.  However 
mail  is  a  very  sensitive  matter  to  the  soldier  in  the  field  and  to  his 
Congressman.  Often,  when  a  soldier  or  airman  failed  to  gat  a  letter  at 
least  every  few  days,  his  first  thought  was  that  the  distribution  system 
had  failed.  He  would  dash  off  a  letter  to  his  representative  about  the 
poor  support  being  provided  the  poor  soldier  in  the  field.  The  legislator, 
aware  of  the  clamor  that  can  quickly  grow  from  such  cases,  instituted  a 
Congressional  inquiry  into  the  matter.  Many  of  these  landed  on  my  desk. 
Usually  everyone  (soldier,  airman,  congressman,  and  military  commander)  was 
satisfied  if  it  could  be  shown  that  at  least  some  mail  went  to  every  unit, 
every  day.  However,  mail  was  sacked  and  palletized  by  destination  only. 

What  was  needed  was  some  method  that  could  "read”  units  out  of  destina¬ 
tions.  The  mechanism  that  was  able  to  accomplish  this  seemingly  impossible 
task  was  probability  theory. 

We  were  able  to  get  data  from  the  Postal  Service  which  included  the 
average  percentages  of  mail  shipped  to  each  unit  under  given  destination 
codes.  Assuming  a  random  process  in  the  selection  of  sacks  to  go  on  a  par¬ 
ticular  pallet  for  a  given  destination,  we  needed  only  to  randomly  select  a 
sufficient  number  of  sacks  from  pallets  destined  to  a  given  location. 


♦NOTE:  For  the  reader  who  wonders  what  ever  happened  to  the  nonselected 
mail  sacks,  please  be  consoled.  After  several  days,  mail  became  such  a 
"critical"  space  taker  at  our  terminal,  extra  airlift  or  surface  transport 
was  laid  on  to  "relieve"  the  port.  , 


These  few  bags  were  easily  "stuffed”  along  with  higher  priority  cargo, 
insuring  near-daily  deliveries  to  every  unit.  The  number  of  congressional 
inquiries  withered.  Soldiers  and  airmen  were  reassured  their  interests 
were  being  supported.  Commanders  slept  more  easily.  Statisticians  sighed, 
"At  last." 

5.2  APPROXIMATING  ENERGY  CONSUMPTION  BY  MODE 

Energy  is,  and  will  continue  to  be,  a  central  topic  in  the  world  of 
transportation  for  many  years  to  come.  Yet,  few  transportation  prac¬ 
titioners  have  directed  their  attentions  to  figuring  out  just  how  one  might 
compute  energy  consumption  by  mode.  One  hears  wild  claims  by  many.  A 
tractor-trailer  rig  at  65  miles  per  hour  (MPH)  gets  better  fuel  mileage 
than  the  same  rig  at  55  MPH.  A  river  barge  beats  all  other  modes  in  terms 
of  energy  consumption;  therefore,  we  should  encourage  more  canal  building. 
Pipelines  are  the  most  energy  efficient  mode— they  should  be  given  the 
right  of  eminent  domain.  Sound  familiar? 

The  following  procedure  will  provide  the  practitioner  a  simple, 
programmed  method  of  reducing  each  of  the  above  claims  to  numbers.  These 
numbers  will  not  always  be  100  percent  accurate,  but  they  are  reasonable 
approximations  of  energy  consumption  under  varying  conditions  for  the 
several  modes.  They  are  particularly  useful  in  evaluating  competing  modal 
claims  and  in  discerning  trends  within  a  given  mode.  The  accuracy  of  any 
of  the  estimations  can  be  increased  by  substituting  actual  values  whenever 
known  for  the  average  values  supplied.  In  both  the  rail  and  the  highway 
modes,  resistance  expressed  in  pounds  per  gross  train  or  motor  ton  is 
related  to  several  factors  as  shown  below: 

A  +  (B/W)  +  CV  +  DV2 

In  the  above  expression,  A  is  a  fixed  factor,  B/W  is  a  factor  that  varies 
by  weight,  CV  represents  the  factor  that  varies  with  velocity,  and  DV^  is  a 
factor  that  varies  with  velocity  squared.  Each  of  the  other  modes  has  a 
similar  set  of  factor  relationships  that  determines  its  resistance. 
Obviously,  it  is  very  cumbersome  to  compute  the  resistance  (R)  for  any  or 
all  of  the  modes  even  at  a  single  speed.  These  calculations  for  the  rele¬ 
vant  velocity  ranges  in  which  we  are  most  interested  are  graphically 
displayed  on  Figure  5-1. 

To  get  a  value  for  the  resistance  in  pounds  per  ton  for  any  mode, 
simply  read  across  the  bottom  scale  to  the  desired  velocity  in  miles  per 
hour  ( MPH ) ,  then  read  up  to  the  modal  line .  Read  the  value  R  from  the  ver¬ 
tical  scale  directly  across  (horizontally  to  the  bottom  scale)  from  the 
intersection  of  the  velocity  and  the  modal  line. 

EXAMPLE  5-1 :  Find  R  for  a  truck  which  weighs  5  tons  gross,  at  30  MPH. 

1.  Read  across  the  bottom  velocity  scale  to  30  MPH. 

2.  Read  up  the  30 -MPH  line  to  the  5 -ton  truck  line. 

3.  Read  the  R  value  across  from  the  velocity-mode  line  intersection. 

4.  The  desired  R  value  is  70  pounds  per  ton. 
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EXAMPLE  5-2:  Find  R  for  a  Great  Lakes  ore  carrier  of  27,000  tons  gross 
at  30  MPH. 


1.  Read  across  bottom  velocity  scale  to  30  MPH. 

2.  Read  up  the  30-MPH  line  to  the  27,000-ton  Great  Lakes  ore  carrier 
line. 

3.  Read  the  R  value  across  from  the  velocity-mode  line  intersection. 

4.  The  desired  R  value  is  19  pounds  per  ton. 

Though  the  R  values  read  from  Figure  5-1  are  a  direct  indication  of  just 
how  much  effort  or  energy  must  be  exerted  on  a  ton  to  keep  it  moving  at  any 
desired  velocity,  it  is  not  expressed  in  the  most  common  measure  of  energy 
intensity  for  the  various  modes.  The  most  useful  measure  of  energy  effi¬ 
ciency  is  the  British  Terminal  Unit  (BTU)  per  ton-mile  (TM) . 

All  fuels  have  a  known  BTU  content.  One  gallon  of  fuel  oil,  for 
example,  contains  approximately  130,000  BTUs.  A  pound  of  coal  contains 
approximately  14,500  BTUs.  Thus,  no  matter  what  fuel  is  consumed  by  a  par¬ 
ticular  mode,  the  BTUs  per  ton-mile  (TM)  can  be  calculated,  facilitating 
cross-modal  comparisons. 

To  turn  the  R  values  of  Figure  5-1  into  BTU/TM,  we  first  convert  R  into 
BTUs/hour,  then  divide  this  value  by  gross  ton  miles  per  hour. 


BTU/hour  *  R  times  gross  tons  times  velocity  times  6.8 
EXAMPLE  5-3;  Find  BTU/TM  (gross)  for  a  5-ton  truck  at  30  MPH. 
BTU/hour  >  70  x  5  x  30  x6.8 

BTU/hour  -  71,400 


If  a  5-ton  vehicle  is  traveling  at  30  MPH,  it  will  generate  150  ton- 
miles  (TM)  per  hour  (30  MPH  x  5  tons).  Energy  intensity  for  the  5-ton 
truck  is  476  BTU/ton-miles  (gross)  (71,400  BTU/hour  4-  150  TM/hour).  The 
same  calculations  that  we  performed  for  the  five-ton  truck  can  be  repeated 
for  any  mode,  at  any  velocity.  However,  the  resulting  BTU/TM  are  still  not 
directly  comparable  because  of  differing  thermal  efficiencies  and  percents 
of  the  gross  weight  dedicated  to  cargo  as  opposed  to  superstructure,  frames, 
bed,  etc. 
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The  BTU  of  the  calculations  so  far  are  pure,  absolute.  We  do  not  get 
130,000  BTU  from  a  gallon  of  fuel  oil  or  gasoline  that  is  put  into  our 
tanks.  In  fact,  the  thermal  efficiencies  of  our  modes  range  from  13  to  25 
percent.  (These  are  shown  in  Figure  5-2).  To  attain  the  71,400  BTU/hour 
for  the  5-ton  truck,  we  will  have  to  expend  more  BTU  in  raw  fuel  because  the 
thermal  efficiency  of  the  average  truck  is  only  25  percent.  This  requires 
an  adjustment  to  BTU/TM  calculation. 

Similarly,  the  TM  or  ton-mile  portion  of  the  calculation  requires  some 
adjustment.  A  low  BTU/TM  is  not  very  meaningful  if  most  of  the  tons  being 
hauled  are  devoted  to  the  vehicle  weight  itself  and  not  payload.  This  ’s 
the  case  with  the  airplane  and  the  bus.  Average  percent  full  factors  for 
each  mode  are  shown  in  Figure  5-2.  This  percent  represents  that  portion  of 
the  gross  weight  that  is  devoted  to  payload. 

Fortunately,  the  adjustments  for  both  the  BTU  and  TM  portions  of  the 
calculation  can  be  made  simultaneously  using  the  adjustment  factor  shown  in 
Figure  5-2.  (This  factor  is  the  product  of  the  percent  full  factor  and  the 
percent  efficiency  factor.)  To  correct  BTU/TM  (gross)  to  BTU/TM  (net), 
simply  divide  BTU/TM  (gross)  by  the  adjustment  factor. 

EXAMPLE  5-4 ;  Find  the  BTU/TM  (net)  of  a  5- ton  truck  at  30  MPH. 

1.  R  *  70  pounds  per  gross  ton  (from  Figure  5-1) 

2.  BTU/hour  =  70  x  5  x  30  X  6.8  -  71,400  BTU/hour 

3.  TM/hour  -  5  X  30  -  150  TM/hour 

4.  BTU/TM  (gross)  *  71,400  -*•  150  **  476  BTU/TM  (gross) 

5.  BTU/TM  (net)  -  476  .18  *  2644.4 


SOME  APPLICATIONS 


Let  us  reduce  the  slogans  in  the  introduction  of  this  section  to  hard 
numbers. 


EXAMPLE  5-5:  A  large  tractor-trailor  rig  (20  tons)  gets  better  fuel  mileage  at 
65  MPH  than  at  55  MPH. 


Find  BTU/TM  (net)  at  55: 

1.  R  -  58 

2.  BTU/TM  (gross) 

a.  BTU/hour  -  58  x  20  x  55  x  6.8  -  433840 

b.  TM/hour  *  20  x  55  -  1100 

BTU/TM  (gross)  -  433840  -4-  1100  -  394.4 

3.  BTU/TM  (net)  -  394.4  4-  .18  -  2191.1 


Find  BTU/TM  (net)  at  65 

1.  R  «  75 

2 .  BTU/TM  ( gross ) 


a.  BTU/hour  -  75  x  20  x  65  x  6.8  -  663000 

b.  TM/hour  *  20  x  65  »  1300 

BTU/TM  (gross)  -  663000  v  1300  -  510 

3.  BTU/TM  (net)  -  510  -r. 18  -  2833.3 


Conclusion:  The  required  expenditure  in  pure  energy  will  not  allow  better 
fuel  mileage  at  65  than  at  55. 


MOTE:  If  you  desire  to  estimate  the  gallons  of  fuel  per  hour  for  this  rig 
at  the  two  speeds  of  55  or  65,  proceed  as  follows:  First,  divide  the 
BTU/hour  by  the  %  efficiency  factor.  Second,  divide  the  resulting  value  of 
step  one  by  130,000. 
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EXAMPLE  5-6:  A  river  barge  beats  all  other  nodes  in  terms  of  energy 
consumption . 

Find  BTU/TM  (net)  for  a  1091-ton  barge  at  22  MPH. 

1.  R  *  100 

2.  BTU/TM  (gross) 

a.  BTU/hour  -  100  x  1091  x  22  x  6.8  »  16,321,360 

b.  TM/hour  -  1091  x  22  -  24002 

BTU/TM  (gross)  -  16321360/24002  -  680 

3.  BTU/TM  -  680  T  .16  *  4250  BTU/TM 

Conclusion:  4250  BTU/TM  (net)  >  BTU/TM  (net)  of  a  20-ton  tractor-trailer 
@  65  MPH.  Barge  is  not  most  energy  efficient  carrier  at  all  speeds. 

EXAMPLE  5-7:  A  pipeline  is  the  most  energy  efficient  mode. 

Find  BTU/TM  (net)  for  a  12"  pipeline  at  7  MPH.  [NOTE:  To  determine  ton 
miles  for  the  pipeline,  we  must  first  convert  the  flow  into  tons.  In  one 
hour,  seven  miles  of  fluid  will  pass  a  given  point.  That  is,  the  flow  past 
a  given  point  is  equivalent  to  a  column  of  fluid  36,960  (7  miles  x  5280 
feet/mile)  feet  tall  and  one  foot  in  diameter.  This  is  a  volume  of  29029.3 
cubic  feet  (36960  feet  x  .7854  square  feet  of  cross  sectional  area),  or  743 
tons,  if  the  fluid  is  oil.  (One  cubic  foot  of  oil  weighs  approximately 
51.2  pounds . ) ] 


1. 

R 

-  130 

2. 

BTU/TM  (gross) 

a. 

BTU/hour  ■ 

130 

x  743  x  7  x  6.8  -  4,597,684 

b. 

BTU/hour  ■ 

743 

x  7  -  5201 

BTU/TM  ( gross )  - 

4597684  f  5201  -  884 

3. 

BTU/TM  (net)  - 

884 

■r  .25  -  3536 
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Conclusion:  The  12-inch  pipeline  is  not  the  most  energy  efficient  node  at 
7  MPH  at  the  BTU/TM  (net)  >  the  BTU/TM  (net)  of  a  20-ton  tractor-trailer 
rig  at  65  MPH  as  calculated  above. 

By  reducing  each  of  the  slogans  cited  in  the  introduction  of  this 
energy  approximation  method  to  hard  numbers,  none  could  be  supported  at  the 
speed  values  selected.  Rhetoric  is  replaced  by  reason;  emotion  by 
evaluation.  Admittedly,  the  simplified  procedure  is  only  an  approximation. 
But,  part  of  its  worth  beyond  being  a  reasonable  approximation  lies  in  the 
ease  of  increasing  its  accuracy.  If  R,  the  fuel  efficiency  factor,  or  the 
percent  full  factor  is  known  for  a  particular  situation,  any  one  or  com¬ 
bination  of  the  actual  values  may  be  substituted  for  the  average  values  of 
figures  one  or  two  directly.  The  procedure  requires  no  modification  to 
accept  these  actual  values. 

EXAMPLE  5-8 ;  Find  the  BTU/TM  (net)  of  60-ton  railroad  car  at  55  MPH. 

1.  R  -  10 

2 .  BTU/TM  ( gross ) 

a.  BTU/hour  »  10  x  60  x  55  x  6.8  ■  224400 

b.  TM/hour  *  60  x  55  ■  3300 

BTU/TM  (gross)  -  224400  4-  3300  -  68 

3.  BTU/TM  (net)  »  68  4-  .19  »  358 

Answer:  358  BTU/TM  (net) 

EXAMPLE  5-9:  Find  BTU/TM  (net)  of  60-ton  heavy  transport  plane  at  260  MPH. 

1.  R  -  158 

2.  BTU/TM  ( gross ) 

a.  BTU/hour  “  158  x  60  x  260  x  6.8  ”  16,760,640 

b.  TM/hour  =*  60  x  260  •  15600 

BTU/TM  (gross)  -  16760640  4-  15600  -  1074 

3.  BTU/TM  (net)  -  1074  4-  .03  -  35813 

Answer:  35813  BTU/1M  (net) 

[NOTE:  The  values  for  the  transport  planes  of  Figure  5-1  do 
aircraft.  These  aircraft  average  about  10000  BTU/TM  (net), 
values  see:  Transportation  Vehicle  Energy  Intensities.  US 
Transportation/NASA.  Reference  Paper,  June  20,  1974.] 
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5.3  GRAPHICAL  SCHEDULING  TECHNIQUE 


Frequently,  the  transportation  practitioner  is  faced  with  the  problem 
of  dispatching  vehicles  or  convoys  of  uniform  makeup  onto  single  lane 
rights-of  way.  This  could  occur  on  single  width  roadways  under  austere 
conditions  or  on  single  track  rail  lines. 

As  one  vehicle  after  another  is  dispatched  outbound,  at  some  point  in 
time  they  will  begin  encountering  returning  vehicles.  Because  they  are  on 
single  vehicle  width  tracks  or  roadways,  provisions  for  passing  in  the  form 
of  turn-outs,  bypasses,  or  pull-around  must  be  provided. 

Tracking  the  movements  for  such  a  system  can  be  a  confusing  and  dif¬ 
ficult  problem.  This  is  especially  true  when  this  activity  occurs  under 
the  primitive  conditions  of  the  battlefield  where  sophisticated  data  pro¬ 
cessing  and  automatic  recording  equipment  are  not  always  available.  The 
following  Graphical  Scheduling  Technique  (GST)  provides  a  ready,  hand- 
calculable  approach  to  the  problem. 

On  the  graphical  schedule  (see  Figure  5-1)  each  right  descending  diagonal 
line  represents  a  departure  from  an  origin  station  at  mile  0.  At  mile  180, 
the  vehicle,  convoy,  or  train*  immediately  turns  around  and  proceeds 
towards  the  origin  station  at  mile  0.  The  time  that  any  vehicle  will  pass 
a  given  point  is  read  directly  from  the  graph. 

For  example,  the  vehicle  that  departed  mile  zero  at  12  midnight  passed 
mile  60  at  6:00  a.m.,  mile  120  at  12  noon  and  arrived  at  mile  180  at 
6:00  p.m.  (These  times  and  distances  are  marked  with  circles  on  Figure 
5-3).  Also,  the  location  of  a  vehicle  at  any  time,  or  conversely,  the  time 
at  which  a  vehicle  will  pass  a  given  location  mile  marker  is  easily  found. 

From  the  graph  it  can  be  seen  that  the  midnight  vehicle  makes  an  imme¬ 
diate  turnaround  at  mile  marker  180,  and  starts  back  toward  the  origin. 
Extending  the  graph  to  seventy-two  hours  the  schedule  for  the  midnight 
departing  vehicle  indicates  two  complete  round  trips  between  mile  zero  and 
mile  180,  with  the  second  arrival  at  the  origin  occurring  precisely  at  the 
same  time  of  day  as  the  original  departure  of  midnight.  (See  Figure  5-4). 

If  the  original  midnight  departing  vehicle  makes  a  complete  cycle  in  72 
hours  (3  days),  it  follows  that  the  three-day  cycle  applies  to  any  vehicle 
departing  mile  zero  at  any  hour  of  the  day,  if  it  proceeds  at  the  speed  of 
the  original  vehicle.  Thus  the  schedule  shown  in  Figure  5-2  actually  depicts 
an  endless  succession  of  three-day  cycles  of  vehicles  dispatched  from  mile 
zero  to  mile  180  and  returning  to  mile  zero. 

A  vehicle  dispatched  at  midnight  on  day  one  of  a  given  month  will  be 
ready  to  be  dispatched  again  from  mile  zero  at  midnight  on  day  4. 


•To  simplify  terminology  the  rest  of  the  discussion  will  use  the  single 
terra  vehicle  to  describe  a  vehicle,  convoy,  or  train. 
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Figure  5-5 


If  dispatched  immediately,  at  midnight  of  day  four,  it  will  be  back  at 
mile  zero  at  midnight  of  day  seven.  This  continues  indefinitely.  Thus  day 
one  for  any  month  represents  the  system's  status  (assuming  that  vehicles  are 
on  time)  on  days  4,  7,  10,  13,  16,  19,  22,  25  and  28.  Similarly,  day  2  of 
the  original  three-day  schedule  represents  days  5,  8,  11,  14,  17,  20,  23, 

26,  29.  Day  3  represents  days  6,  9,  12,  15,  18,  21,  24,  27,  30.  (The  days 
that  each  24-hour  period  represent  are  shown  above  each  of  the  three  24- 
hour  periods  of  Figure  5-5.  For  simplification,  the  first  24  hour  period 
is  labeled  an  A-day;  the  second,  a  B-day  and  the  third,  a  C-day) 

From  Figure  5-5  one  can  read  directly  for  the  vehicle  that  departed  mile 
zero  at  0936  of  day  one  (shown  as  a  heavy,  solid  black  line),  that  it  will 
pass  mile  marker  84  on  day  22  (or  any  of  the  other  days  of  the  month  shown 
with  day  1).  On  the  27th  of  the  month  at  1800,  it  will  be  at  mile  marker 
156,  traveling  towards  the  mile  marker  zero.  It  will  have  arrived  at  mile 
marker  156  at  the  same  time  as  the  vehicle  that  left  mile  marker  zero  48 
hours  behind  it,  on  day  1. 

GST  provides  a  method  of  determining  where  each  vehicle  will  be  at  any 
point  in  time  during  any  desired  period.  (The  period  is  not  limited  to  a 
month.  The  use  of  the  Julian  calendar  would  easily  extend  the  period  to  a 
full  year  where,  if  Day  A  represented  1  January,  or  001  on  the  Julian 
calendar.  Day  C  would  represent  Julian  day  366  in  a  leap  year.)  It  also 
dictates  where  each  bypass,  turnout,  or  pull  around  must  be  located.  It 
indicates  vehicle  spacing,  an  item  that  affects  safety.  For  example,  from 
Figure  5-5  at  6  a.m.  on  day  7  (Day  A),  train  3  is  at  mile  12,  train  2  at 
mile  36.  Trains  1,  15,  14,  13,  12,  and  11  are  at  mile  markers  60,  84,  108, 
132,  156,  and  180  respectively.  Spacings  are  a  constant  24  miles. 

In  many  situations  the  basic  GST  method  explained  above  can  be  very 
helpful.  However,  its  flexibility  to  accommodate  more  complex  situations 
is  deceiving. 

That  the  slope  of  the  diagonals  represents  average  speed  of  the  train 
is  not  immediately  apparent.  The  slope  of  the  diagonal  for  train  one  is 
180  miles  divided  by  18  hours  or  10  miles  per  hour.  (A  positive  slope 
indicates  outbound  movement).  Thus,  GST  can  accommodate  any  speed  for  any 
particular  distance  segment  simply  by  changing  the  slopes  of  the  diagonal 
for  that  segment. 

Figure  5-6  depicts  a  two-speed  situation.  Between  mile  marker  0  and  50, 
the  train  averages  10  mph.  Between  50  and  100,  it  averages  25  mph.  The 
overall  average,  represented  by  the  dashed  line,  is  14.29  mph. 

Assuming  an  hour  turnaround  at  the  end  of  each  roundtrip  at  mile  zero, 
four  trains  are  required  to  make  a  full  schedule  with  a  departure  every 
four  hours. 

The  completed  graphic  schedule  indicates  that  four  pull  bys  or  bypasses 
are  required.  One  each  at  miles  16,  44,  and  72;  one  at  the  turnaround 
point,  mile  100. 


figure  S-6 
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Figure  5-6  also  represents  a  common  situation  where  the  terrain  might  be 
predominantly  uphill  to  mile  50/  and  downhill  from  50  to  100.  The  return 
trip  is  similarly  uphill  for  the  first  half  and  downhill  over  the  second. 
This  accounts  for  the  different  speeds  over  the  two  halves  in  both 
directions.  In  summary,  GST  is  a  simple  scheduling  technique  that  may  be 
used  in  primitive  conditions.  It  indicates  where  all  vehicles  are  located 
at  any  time.  It  can  also  be  used  to  predict  where  any  vehicle  will  be  at 
some  future  time.  It  is  flexible  enough  to  accommodate  a  variety  of  speeds 
over  the  given  course.  Perhaps  most  important,  it  simplifies  the  locating 
of  those  spots  where  vehicles  must  pass. 

5.4  VOLUME  OF  LIQUID  IN  TANKS 

a.  General ♦  Transportation  planners  and  users  frequently  need  weight  and 
cube  data  for  liquids.  After  volume  of  liquid  has  been  determined,  the 
weight  may  be  found  by  multiplying  the  total  volume  in  gallons  by  the  weight 
of  1  gallon  of  the  liquid.  (Unit  weights  for  various  liquids  are  to  be 
found  in  Miscellaneous  Data.)  It  is  assumed  that  the  cylindrical  and  ellip¬ 
tical  tanks  discussed  below  have  flat  ends.  (If  a  tank  does  not  have  flat 
ends,  then  the  curvature  of  the  ends  must  be  determined,  and  the  volume 
contained  in  the  ends  added  to  the  volumes  shown  below.  However,  in  such 
cases,  it  is  usually  more  advantageous  to  calibrate  the  tank,  which  entails 
the  drawing  off  or  adding  of  measured  amounts  of  the  liquid  and  measuring 
the  change  in  liquid  depth  in  the  tank).  In  the  charts  below,  single  and/or 
double  interpolation  should  be  used  to  determine  values  between  those  shown. 
Values  shown  are  close  approximations  for  60°  F.  and  can  be  used  for  prac¬ 
tical  purposes.  Variations  in  conditions,  especially  temperature,  cause  the 

YV'  values  to  vary.  If  exact  quantities  are  desired,  the  formulas  shown  must  be 

used  in  conjunction  with  temperature  coefficients  of  expansion  and  contrac¬ 
tion  for  the  liquids  and  containers.  Presentation  of  detailed  computations 
for  varying  conditions  is  not  within  the  scope  of  this  manual,  but  approxi¬ 
mate  computations  for  POL  products  under  various  temperatures  are  shown  in 
f  below,  unless  otherwise  specified,  dimensions  are  in  inches. 

b.  Cylindrical  Tanks 
(1)  Vertical 

(a)  Formula.  The  volume  of  liquid  in  a  vertical  cylindrical  tank  may 
be  computed  by  using  the  formula  shown  below. 

Gallons  per  foot  of  liquid  depth 

(3.1416)  x  (radius  of  tank)2  x  (12) 

"  231 

(9.4248)  x  (diameter  of  tank)2 
"  231 

■  (.0408)  x  [diameter)2 
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Dtawtor 

OMtoa* 

OtaMtar 

Ota  mater 

Oallama 

<**.) 

far  ft 

ftp.) 

far  ft 

tilt.) 

per  ft 

334 

4460 

466 

8,460 

770 

24430 

sac 

4,680 

460 

8,630 

780 

24,870 

334 

4.610 

466 

8,880 

790 

25400 

337 

4,630 

470 

9,010 

300 

26,120 

333 

4,670 

476 

9460 

810 

26,800 

339 

4,690 

480 

9490 

820 

27,480 

340 

4,730 

486 

9,600 

830 

28,100 

341 

4,740 

400 

9,790 

840 

28.800 

342 

4.770 

496 

10,000 

860 

29.530 

343 

4,800 

600 

10400 

860 

30,200 

344 

4430 

605 

10,400 

870 

30.900 

348 

4460 

510 

10,600 

880 

31.680 

343 

4400 

515 

10410 

800 

32,350 

347 

4430 

630 

11,010 

000 

33,100 

348 

4,980 

535 

11440 

910 

33,800 

349 

4430 

630 

11,460 

920 

34.600 

360 

4,990 

635 

11,670 

930 

35,300 

361 

5430 

540 

11490 

940 

36.100 

363 

5460 

646 

12,100 

960 

3«.9«o 

363 

6,080 

660 

13,330 

900 

37,700 

354 

6,110 

666 

13460 

970 

38,450 

366 

6,180 

560 

12,790 

980 

39.250 

366 

6,180 

566 

13,090 

990 

40.100 

367 

6410 

670 

13480 

1.000 

40450 

388 

6440 

575 

13400 

1,010 

41.700 

369 

5470 

580 

13,750 

1,020 

42.600 

360 

6480 

686 

13480 

1,030 

43.400 

366 

5,440 

690 

14430 

1.040 

41450 

370 

6480 

600 

14,700 

1.050 

45.100 

376 

5,740 

610 

16410 

1.000 

45400 

330 

5490 

680 

16,700 

1,070 

46.750 

386 

6,060 

630 

16400 

1.080 

47,750 

890 

6400 

640 

16,750 

1,090 

48.60 0 

396 

6470 

660 

17460 

1.100 

49.500 

400 

6430 

660 

17,770 

1.110 

50400 

406 

6,700 

670 

18430 

1.120 

51450 

410 

6460 

680 

18480 

1,130 

52400 

418 

7.030 

690 

19,460 

1.140 

53450 

430 

7,190 

700 

20,000 

1.150 

54,100 

426 

7,330 

710 

20,600 

1.160 

65,100 

430 

7440 

730 

21,190 

1,170 

56.000 

436 

7.730 

730 

21,750 

1,180 

57,000 

440 

7400 

740 

32,360 

1,190 

58,000 

446 

8480 

760 

22480 

1400 

58,900 

480 

8460 

760 

33,600 

(2)  Horizontal. 

P«H  •/  tmmk  4*pth 

Part  af  tmak  mteW  IM 

(a)  When  the  capacity  of  a 

horizontal 

.80 

.860 

804 

cylindrical  tank  ia  known,  the  quantity  in  the 

.  f  •» 

.70 

.740 

tank  may  be  approximated  by  uaing  the  acale 

,r»5 

.687 

below. 

.00 

.626 

.nr, 

.563 

Part  0/  tea*  isptk  JUM 

r«r«  0t  taut 

.50 

.500 

1.000 

.45 

.437 

0.96 

.974 

.40 

474 

40 

.948 

.35 

.313 

.38 

.904 

..10 

.262 
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ran  a/  MM  dtpiA  /UUd  Fart  #/  MM  a***at<»  JUM 

.25  .195 

.20  .140 

.15  .096 

.10  .052 

.06  .026 

Example.  A  horizontal  tank  in  80  inch**  in  diameter. 
Depth  oi  liquid  in  tank  is  20  inches.  Pull  tank  capacity 
is  8,000  gallons.  Find  th*  number  of  gallons  actually 
in  tank. 

~  =  0.25  (part  of  tank  depth  Ailed) 

0.25  (part  of  tank  depth  Ailed)  =  0.196  (part  of  tank 

capacity  filled) 

0.196  x  8,000  =  1,568  gallons  in  tank 

(b)  When  the  capacity  of  a  horizontal 
cylindrical  tank  is  not  known,  the  volume  of 
liquid  may  be  computed  as  shown  below. 

L  s  length  of  tank 
l  s  depth  of  liquid  in  tank 
r  =  radius  of  tank 

h  st  distance  from  top  of  tank  to  surface 
of  liquid 


D  sc  diameter  of  tank 
a  =  cross-sectional  length  of  “wet  arc" 
formed  by  liquid,  measured  on  lower 
part  of  tank 

a'  ss  cross-sectional  length  of  “dry  arc” 
above  liquid,  measured  on  upper  part 
of  tank 

2.  When  tank  is  less  than  half  full: 
Volume  (gel.)  =  -  (r  -  1)  V2ri  —  f*J  X-gj 

2.  When  tank  is  half  full: 

Volume  (gsl.)  =  ^ 

3.  When  tank  is  more  than  half  full: 

Volume  (gsl.) 

=  ^3.1416r*  -  o'r  +  (r  -  A)  V2r*  -  h*  J  x  gj- 

(e)  When  it  is  not  practical  to  measure 
a  and  a'  in  the  formulas  above  (because  of 
buried  tanks,  etc.),  the  lengths  of  these  arcs 
can  be  computed  by  determining  l/D  and  h/D 
ratios,  and  using  the  table  below. 


I/O 

Truk  Mm  than  hall  fal 1 
to 

ml  Met*  O 

b»  M  attan  I/O 

a 

OaaaMtV  M 
multiple  0 
t>  M  a*  tan 
• 

*/o 

TaM  am  Ikon  half  fad 
Qaaatti*  M 
aaltMl  O 

k,  M  a*tam  h/D 

Qaaatit*  ta 
multiple  D 
4*  M  Mian 

.01 

0.200 

26 

1.070 

.01 

0200 

26 

1.070 

.02 

0.284 

27 

1.098 

.02 

0284 

27 

1.093 

.03 

0.348 

28 

i;us 

U37 

.03 

0.348 

28 

1.115 

.04 

0.403 

.29 

.04 

0.403 

29 

1.137 

.05 

0.451 

.30 

1.150 

.05 

0.451 

.30 

1.159 

.06 

0.495 

.31 

1.181 

.06 

0.495 

21 

1.181 

.07 

0.536 

.32 

1203 

.07 

0236 

.32 

1203 

.08 

0.574 

.33 

1224 

.08 

0.574 

.33 

1224 

.09 

0.009 

.34 

1245 

.09 

0.609 

.34 

1246 

.10 

0.644 

36 

1266 

.10 

0.644 

.35 

1266 

.11 

0.676 

.36 

1287 

.11 

0.676 

.36 

1287 

.12 

0.708 

.37 

1.308 

.12 

0.708 

.37 

1208 

.13 

0.738 

.38 

1228 

.13 

0.738 

.38 

1.328 

.14 

0.767 

.39 

1.349 

.14 

0.767 

.39 

1.349 

.15 

0.795 

.40 

1.369 

.16 

0.795 

.40 

1269 

.16 

0.823 

.41 

1.390 

.16 

0.823 

.41 

1290 

.17 

0.850 

.42 

1.410 

.17 

0250 

.42 

1.410 

.18 

0.876 

.43 

1.430 

.18 

0276 

.43 

1.430 

.19 

0.902 

.44 

1.451 

.19 

0202 

.44 

1.451 

JO 

0.927 

.46 

1.471 

20 

0227 

.46 

1.471 

21 

0.952 

.46 

1.491 

.21 

0.952 

.46 

1.491 

.22 

0.976 

.47 

1211 

22 

0.976 

.47 

1.511 

.23 

1.000 

.48 

1231 

23 

1.000 

.48 

1.531 

.24 

1.024 

.49 

1251 

24 

1.024 

.49 

1251 

.25 

1.047 

.50 

1271 

25 

1.047 

.50 

1271 

(d)  The  volume 

of  liquid 

in  flat-end 

length.  Therefore,  in 

the  formulas  shown  in 

horizontal  cylindrical  tanks  of  various  sizes  is 
given  below.  Quantities  are  in  U.S.  gallons  for 
each  liquid  depth  shown  per  foot  of  tank 


(b)  above,  the  ratio  L/231  becomes  a  constant 
of  0.0519.  To  obtain  liquid  volume,  multiply 
the  figure  from  the  chart  by  the  tank  length 
(feet  and  fractions). 
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J-71  10.4  18.0  28.9  40.0  52.1  65.1  78.7  93.0  107.9  123.2  138.9  164.9  171.2  187.7  204.4  221.1  238.0  254.8 

3.77  10.6  19.2  29  4  40.6  52.9  66.1  79.9  94.5  109.6  126.2  1414  157.6  174.3  191.2  208.2  225.4  242.9  259.9 

l°'l  199  298  41,1  55-7  67-1  81-2  M0  1114  127.3  143.7  160.4  177.4  194.7  212.1  229.7  247.8  265.0 

3.87  10.9  19.7  30.2  41.8  54.0  67.9  82.4  97.6  113.2  1294  145.8  162.9  180.3  198.8  216.8  233.8  252.3  270.0 

3.92  1 11.0  |  19.8  1  30,6  42.0  64.2  68.7  83.6  99.2  114.9  1314  148.0  165.5  183.2  203.0  219.5  238.0  256.8  274.9 


306.3  Ki  t  342.4  360.0  377.9  394.9  411.9  428.3  444.0  460.3  474.8  489.8  &03.0  816.0  327.0  837.8 

312.0  330.0  349.0  367.0  386.0  403.0  421.0  438.6  464.0  472.0  487.6  503.6  618.0  632.0  643.0  6S6.0 


Width  of  tank  (in.) 


Lurl 

231 


e.  Rectangular  Tanka.  The  chart  below  gives 
liquid  volumes  for  each  foot  of  liquid  depth  in 
rectangular  tanks  of  various  sizes.  To  obtain 
total  liquid  volume,  multiply  the  figure  shown 
by  liquid  depth  (feet  and  fractions).  Figures 
in  the  chart  were  obtained  from  the  following 
formula : 


Volume  (gal.)  = 

Where 

L  =  length  of  tank  in  inches 
w  s  width  of  tank  in  inches 
l  =  depth  of  liquid  in  inches 


Length  of  tank  (in.) 


It 

II 

It 

n 

u 

rr 

.10 

31 

31 

12 

7.48 

9.35 

1123 

13.10 

14.95 

16.85 

18.70 

20.58 

22.46 

IS 

9.36 

11.68 

14.02 

16.36 

18.72 

21.00 

23.33 

25.70 

28.02 

18 

11.23 

14.02 

16.83 

19.65 

22.44 

25.25 

28.05 

10.90 

13.66 

21 

13.10 

16.36 

19.65 

2220 

26.18 

29.50 

32.70 

16.00 

39.30 

24 

14.95 

18.72 

22.44 

26.18 

29.97 

33.70 

37.40 

41.20 

44.80 

27 

16.85 

21.00 

2525 

2920 

33.70 

37.90 

42.10 

46.30 

50.50 

30 

18.70 

23.33 

28.05 

32.70 

37.40 

42.10 

46.80 

51.50 

56.10 

33 

20.58 

25.70 

30.90 

36.00 

4120 

46.30 

51.50 

56.60 

61.80 

36 

22.46 

28.02 

33.66 

39.30 

44.80 

50.50 

56.10 

61.80 

67.30 

30 

24.30 

30.40 

36.50 

42.60 

48.70 

54.70 

60.80 

66.90 

72.90 

42 

26.19 

32.70 

3924 

45.80 

52.30 

58.90 

65.50 

72.00 

78.50 

46 

28.05 

35.10 

42.10 

49.10 

56.10 

63.10 

70.20 

77.20 

84.20 

48 

29.93 

37.40 

44.90 

52.30 

59.80 

67.30 

74.80 

82.30 

89.80 

51 

31.75 

39.71 

47.70 

55.70 

63.60 

71.50 

79.60 

87.50 

95.40 

54 

33.70 

42.10 

50.50 

58.80 

67.30 

75.70 

84.10 

92.60 

101.00 

57 

35.50 

44.40 

5325 

62.20 

71.00 

80.00 

88.80 

97.70 

106.60 

«0 

37.40 

46.76 

56.10 

65.40 

74.80 

84.10 

93.50 

102.90 

112.20 

63 

39.30 

49.10 

68.90 

68.70 

78.40 

88.40 

9820 

108.00 

117.70 

66 

41.10 

51.40 

61.70 

72.00 

82.30 

92.30 

102.80 

113.30 

123.40 

69 

43.10 

53.70 

64.50 

75.20 

86.90 

96.80 

107.40 

118.40 

129.00 

72 

44.90 

56.10 

6720 

78.50 

89.70 

100.90 

112.20 

123.50 

134.60 

75 

46.75 

58.40 

70.10 

81.80 

93.40 

105.10 

117.00 

128.60 

140.10 

78 

48.60 

60.75 

7220 

85.10 

97.20 

109.30 

121.60 

133.70 

145.90 

81 

50.50 

68.10 

76.70 

88.30 

100.80 

113.50 

126.30 

139.00 

151.40 

84 

52.40 

65.50 

78.60 

91.60 

104.60 

117.60 

131.00 

144.00 

157.10 

87 

54.20 

67.80 

81.30 

94.80 

108.30 

122.00 

138.60 

149.20 

162.60 

90 

66.10 

70.20 

84.20 

98.20 

11220 

126.10 

140.30 

154.50 

168.30 

93 

58.00 

72.45 

8620 

101.40 

115.70 

130.40 

145.00 

159.50 

174.00 

96 

59.80 

74.80 

89.80 

104.50 

119.60 

134.50 

149.60 

164.50 

179.50 

99 

61.70 

77.20 

92.60 

108.00 

123.50 

118.90 

154.40 

169.70 

185.20 

102 

63.60 

79.60 

95.50 

111.20 

127.20 

141.00 

158.90 

174.50 

190.70 

105 

65.40 

81.90 

98.30 

114.50 

131.00 

147.10 

161.60 

179.60 

196.40 

108 

67.30 

84.20 

10120 

1 17.80 

134.60 

151.50 

168.10 

184.90 

202.00 

111 

69.20 

86.60 

103.90 

121.00 

138.10 

155.50 

171.00 

190.00 

205.20 

114 

71.00 

88.90 

106.70 

124.40 

142.10 

159.80 

177.70 

195.0C 

211.20 

117 

72.80 

91.30 

109.50 

127.50 

145.90 

164.00 

182.40 

200.50 

219.00 

120 

74.75 

93.60 

112.40 

1.10.80 

149.60 

168.20 

187.00 

205.30 

224.10 
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Width  of  took  (in.)  Width  of  tank  (in.) 


Length  of  tank  (in.) 


69 

236.80 

247.40 

258.00 

-  ‘ 

72 

247.00 

258.00 

269.30 

*  . 

75 

257.00 

269.00 

280.40 

78 

267.40 

279.80 

291.80 

;  ' 

d 

81 

277.90 

290.40 

302.90 

-■  ■ 

84 

288.00 

301.50 

314.20 

M 

e 

87 

298.00 

312.00 

325.00 

1 

3  90 

at  102 

105 


519.00 

329.00 

339.30 

349.60 
360.00 

370.30 

380.60 
391.00 
401.70 
411.50 


333.00 

342.10 

355.00 

365.80 

376.80 
387.20 
398.00 
408.00 
419.00 
430.70 


347.50 
359.00 
370.70 

381.40 
392.90 
404.00 
415.00 

426.40 

437.50 
449.00 


[III 

El 

ES 


90 

280.50 
294.10 
308.60 
322.80 

336.50 
350.40 
364.70 


)  421.00 

435.00 

433.50 

449.00 

447.00 

462.50 

460.70 

477.00 

474.50 

491.00 

487.00 

505.00 

501.50 

519.00 

515.00 

533.00 

528.00 

547.50 

542.00 

561.00 

Lanjtth  of  tank  (in.) 


Width  of  Unk  (in.) 


Length  of  Unk  (in.) 


IM 

in 

in 

tn 

in 

U$ 

in 

12 

74.78 

76.70 

78.50 

80.40 

8225 

84.10 

86.00 

IS 

98.60 

96.90 

9820 

100.50 

103.00 

10520 

107.60 

18 

112.40 

11520 

118.00 

120.80 

12320 

126.50 

12920 

21 

130.80 

134.10 

187.40 

14020 

143.90 

14720 

15020 

24 

149.60 

153.50 

157.10 

160.90 

16420 

16820 

172.00 

27 

16820 

172.50 

176.70 

180.90 

185.00 

189.40 

19820 

30 

187.00 

101.80 

196.40 

201.30 

206.70 

21020 

215.10 

33 

206.30 

210.50 

215.50 

220.90 

226.00 

231.00 

23620 

36 

224.40 

230.00 

235.60 

241.00 

24620 

25220 

258.10 

30 

242.90 

249.00 

255.00 

261.00 

267.00 

273.30 

279.70 

42 

28120 

268.50 

274.90 

281.00 

288.00 

294.40 

301.10 

46 

280.70 

28720 

294.50 

302.00 

308.00 

31620 

322.80 

48 

299.10 

30420 

314.10 

32120 

329.00 

33620 

844.00 

51 

318.00 

325.00 

333.80 

34120 

349.80 

367.80 

36620 

54 

336.60 

345.00 

353.40 

362.00 

370.30 

378.80 

387.00 

57 

355.30 

364.30 

373.00 

382.00 

391.00 

400.00 

409.00 

no 

374.00 

383.80 

39220 

402.50 

411.40 

421.00 

430.30 

<53 

302.20 

402.00 

412.00 

422.00 

432.00 

441.80 

461.80 

60 

411.50 

421.50 

432.00 

442.00 

45220 

463.00 

473.00 

60 

430.70 

441.00 

452.00 

46220 

473.00 

484.00 

495.00 

72 

440.00 

460.00 

471.30 

482.00 

49320 

506.00 

616.00 

75 

467.00 

479.00 

491.00 

502.00 

614.00 

620.00 

537.50 

78 

486.00 

498.00 

510.50 

522.50 

535.00 

647.00 

669.00 

81 

504.00 

517.00 

530.00 

542.50 

556.00 

56720 

58020 

84 

523.00 

537.00 

54920 

562.50 

576.00 

589.00 

602.00 

87 

542.00 

556.00 

569.50 

58220 

697.00 

610.00 

623.00 

00 

561.00 

575.00 

589.00 

603.00 

617.00 

631.00 

645.00 

03 

579.00 

593.00 

607.50 

62220 

637.00 

662.00 

666.00 

06 

508.00 

614.00 

628.00 

644.00 

65820 

674.00 

688.00 

00 

617.00 

632.50 

648.00 

663.00 

679.00 

694.90 

711.00 

102 

6:16.00 

652.00 

667.50 

684.00 

699.00 

71520 

732.00 

105 

654.00 

671.60 

687.50 

703.00 

71920 

736.50 

752.50 

108 

673.00 

690.00 

70720 

723.50 

741.00 

757.50 

774.00 

111 

601.00 

709.00 

726.00 

743.00 

760.00 

77720 

795.00 

114 

711.00 

729.00 

747.00 

764.00 

782.00 

800.00 

818.00 

117 

729.00 

748.00 

766.00 

784.00 

802.50 

820.00 

839.00 

120 

747.50 

767.00 

785.00 

804.00 

823.00 

842.00 

866.00 

d.  Elliptical  Tanka  (Horizontal). 

(1)  Formula.  The  full  capacity  in  gallons 
of  horizontal  elliptical  tanks  with  flat  ends  can 
be  computed  by  the  following  formula. 

Volumo  (gal.)  - 

ZS1 

Where 

a  =  long  axis  of  elliptical  cross-section 


b  ss  short  axis  of  elliptical  cross-section 
L  =  length  of  tank 

(2)  Table.  For  tanks  of  known  capacity, 
the  partial  content  in  gallons  for  varying 
liquid  depths  can  be  determined  by  using  the 
scale  below.  (See  6(2)  above  for  example  of 
use.) 


Pvt  ef 

Pmrt  el 

Pert  if 

Pvt  •/ 

PmH  #/ 

U*Jr  mm*. 
tt*  /Um 

turn k  depth 

JUM 

tenh  mm*. 

U,  ttui 

tmnk  depth 
lUled 

tmnk  MyMr* 
it*  III** 

0.13 

0.0764 

0.43 

0.4114 

0.73 

0.7814 

0.14 

0.0880 

0.44 

0.4240 

0.74 

0.7927 

0.15 

0.0940 

0.46 

0.4366 

0.75 

03039 

0.15 

0.1032 

0.46 

0.4492 

0.76 

0.8150 

0.17 

0.1127 

0.47 

0.4619 

0.77 

02260 

0.18 

0.1224 

0.48 

0.4746 

0.78 

0.8368 

0.19 

0.1323 

0.49 

0.4873 

0.79 

0.8474 

0.20 

0.1423 

0.50 

0.8000 

0.80 

03577 

0.21 

0.1826 

0.81 

03127 

0.81 

0.8677 

0.22 

0.1682 

0.62 

03265 

0.82 

0.8776 

0.23 

0.1740 

0.63 

0.6381 

0.83 

0.S873 

0.24 

0.1880 

034 

0.6608 

0.84 

0.8968 

0.2S 

0.1961 

0.56 

0.5634 

0.88 

0.9060 

026 

0.2073 

0.66 

03760 

0.8« 

0.9160 

0.27 

0.2186 

0.57 

03886 

0.87 

03236 

028 

02300 

038 

0.6011 

0.88 

0.9320 

0.29 

02407 

0.59 

0.6136 

0.89 

0.9402 

0.30 

02831 

0.60 

0.6261 

0.90 

0.9480 

0.31 

02648 

0.61 

0.6386 

0.91 

0.9555 

0.32 

0.2766 

0.62 

0.6810 

0.92 

0.9626 

0.33 

02884 

0.63 

0.6634 

0.93 

0.9693 

0.34 

0.3003 

0.64 

0.6786 

0.94 

0.9755 

0.38 

0.3119 

0.66 

0.6881 

0.95 

0.9813 

0.36 

0.3244 

0.66 

0.6997 

0.96 

0.9866 

0.37 

0.3366 

0.67 

0.7116 

0.97 

0.9910 

0.38 

0.3490 

0.68 

0.7234 

0.98 

0.9950 

0.39 

0.3614 

0.69 

0.7352 

0.99 

0.9980 

0.40 

0.3739 

0.70 

0.7469 

1.00 

1.0000 

0.41 

0.3864 

0.71 

0.7593 

0.42 

0.3989 

0.72 

0.7700 

e.  Spherical  Tanka.  The  chart  ((4)  below)  the  sphere  segment  divided  by  the 

may  be  used  to  determine  the  liquid  volume  in  cube  of  the  sphere’s  diameter 

spherical  tanks  for  various  depths  of  the  li-  D  =  diameter  of  tank 

quid.  Also  shown  are  three  formulas  which  (1)  when  tank  ia  leaa  than  half  full: 

may  be  used  instead  of  the  table.  When  more  (n)  To  find  the  volutne  of  the  liquid. 

precise  computations  are  desired,  use  the  {orm  the  ratio  d/D  and  find  the  value  of  C  in 

formulas  which  do  not  require  figures  from  the  table.  Then  Volume  of  liquid  =  D*C 

the  chart.  Volumes  are  shown  in  cubic  inches;  (6)  Alternate  method:  (chart  not  re- 

cubic  inches  are  divided  by  231  to  obtain  gal-  quired)  Volume  of  liquid  =.5236d=(3D-2rf) 

Ions.  In  this  subparagraph,  the  words  “sphere”  ^ )  When  tank  ia  half  full- 

and  “tank”  have  the  same  meaning,  and  the  Volume  of  liquid  =.2618Z>«  (chart  not 

letters  used  represent  the  following:  required) 

d  s  depht  of  liquid  or  height  of  seg-  (3)  When  tank  ia  more  than  half  full: 

ment  of  sphere  formed  by  liquid  (a)  Since  the  ratio  d/D  (if  d  is  con- 

(when  tank  is  less  than  half  full)  sidered  the  depth  of  liquid)  becomes  greater 

=  distance  from  top  of  sphere  to  surface  than  0.50,  the  table  no  longer  applies.  There¬ 
of  liquid  (when  tank  is  more  than  fore,  the  chart  should  be  used  to  determine  the 

half  full)  volume  of  the  unfilled  portion.  Subtract  this 

C  =  a  vr.iue  for  different  d/D  relation-  from  the  total  volume  of  the  tank  (.5236P*)  to 

ships  which  represents  the  volume  of  obtain  the  volume  of  the  filled  portion.  In  this 
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case,  d  becomes  the  distance  from  the  top  of 
tank  to  the  surface  of  liquid.  To  find  the  vol¬ 
ume  of  the  unfilled  portion,  form  the  ratio  d/D 
and  find  the  value  of  C  in  the  chart.  The  un¬ 
filled  portion  is  then  D*C,  and  the  volume  of 
the  filled  portion  can  be  determined  as  follows : 

Volume  of  liquid  =  .523617*  -  D*C 
=  O'  (.5236  -  C) 


(b)  Alternate  method:  (chart  not  re¬ 
quired) 

[Volume  of  liquid  =  .52360*1 
-  .5236d*  (30 -2d)  J 

(4)  Table. 


d, ID 

C 

d/D 

C 

d/D 

C 

d/D 

C 

d/D 

C 

0.01 

0.0002 

0.11 

0.0176 

0.21 

0.0696 

0.31 

0.1198 

0.41 

0.1919 

0.02 

0.0006 

0.12 

0.0208 

0.22 

0.0649 

0.32 

0.1265 

0.42 

0.1995 

0.03 

0.0014 

0.13 

0.0242 

0.23 

0.0704 

0.33 

0.1334 

0.43 

0X072 

0.04 

0.0024 

0.14 

0.0279 

0.24 

0.0760 

0.34 

0.1404 

0.44 

0.2149 

0.05 

0.0038 

0.15 

0.0318 

0.25 

0.0818 

0.35 

0.1475 

0.45 

0X227 

0.00 

0.0054 

0.16 

0.0359 

0.26 

0.0878 

0.36 

0.1547 

0.46 

0X305 

0.07 

0.0073 

0.17 

0.0403 

0.27 

0.0939 

0.37 

0.1620 

0.47 

0X383 

0.08 

0.0095 

0.18 

0.0448 

0.28 

0.1002 

0.38 

0.1694 

0.48 

0X461 

0.00 

0.0 120 

0.19 

0.0495 

0.29 

0.1066 

0.39 

0.1768 

0.49 

0X589 

0.10 

0.0147 

0.20 

0.0545 

0.30 

0.1131 

0.40 

0.1843 

• 

0250 

0X818 

/• 

Temperature  Corrections  ( Approximate )  for  POL  Products. 

Coojloient  of 

CeefUteet  •/ 

I'ndnci 

ex  ps mips  pf 
foiilrtftivN 

(boot  of  10*  fJ* 

Hrndmnt 

1 

ee atrectiee 
rteee  ef  ST  **.)• 

Aviation  gasoline 

Motor  gasoline  and  nuphtha  (other  than 
cleaning  solvent) 

Light  crude  oil  (above  35*  API  gravity), 

0.00070 

0.00060 

Medium  crude  oil  (15*  to  35*  API 
gravity,  and  lubricating  oil 

Heavy  crude  oil  (up  to  15*  API  gravity), 
residual  oil,  and  asphalt 

0.00040 

0.00035 

jva  iiwti  vwaniu(  wnnii,  w 

distillate  fuel  oil,  and  fog  oil 

erusein. 

0.00060 

*fU(«ra  u»  ehani to  in  unit  volume  per 

(on  ever 

er  seder  W  V. 

Dimensions  of  Containers 

V>9UmfalMft 

UmU 

in 

Tvr 

Sim  nf  paeSa—  fin.) 
Width 
nr 

HnWkt 

rmmkntn 

pnnhntn 

Sieite 

Drum: 

U.S.  55-gal.,  16-gaga 

1 

Drum 

0 

24* 

34* 

U.S.  56-gal.,  18-gage 

1 

Drum 

0 

24* 

34* 

Can: 

U.S.  5-gal.  (gasoline) 

1 

Can 

13* 

6* 

18* 

U.S.  5-gal.  (oil) 

2 

Case 

0 

11 15/16 

14  3/16 

U.S.  5-qt.  (oil) 

6 

Case 

0 

14 

10 

U.S.  1-qt.  (oil) 

12 

Case 

18 

13 

6 

Pail: 

U.S.  25-lb  (grease) 

1 

Pail 

0 

11* 

11* 
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Bulk  Capadtto 


Carrier 

Csileue 

Ceeeiiu  »M 

Short  tone 

/mbs  sil 

B«rg«,  coastwise  * .  . 

200,000  to  400,000 

(hulk) 

45.9  to  91.8 

(kmtkt 

761  to  1*522 

Barg*,  harbor  and  canal  *  . .  . .  . 

16,000  to  30,000 

257 

57  to  114 

Barge,  Nary  ponton* . 

84,000 

120 

Pipeline: 4 

4-inch  . 

304,000  per  day 

930 

2,000 

1,150 

6-1nch . 

665,000  per  day* 

3,500 

2.500 

8-inch .  . 

1,135,000  per  day 

614  to  1,228 

4,350 

Railroad  tank  car .  . 

8,000;  10,000;  12,000 

24.1;  30.8;  36.8 

30.4;  38.1;  45.7 

Semitrailer,  12-ton,  4W  .  . 

6,000 

15.3 

19 

Ship,  large  tanker  *  .  . 

2.5  to  11  million 

7,620  to  33,500 

9.480  to  43,800 

Ship,  small  tanker* . 

600,000  to  2  million 

1,830  to  6,140 

2.280  to  7.610 

Tank,  bolted-steel  . 

10,500  ;  42,000; 

32.2;  128;  1,280 

39.9;  160;  1.600 

Tank,  portable  fabric* . 

420,000 

10,000 

30.fi 

38.1 

Tank  truck,  F-3,  fuel  or  oil . 

750 

2.3 

2.9 

Tank  truck,  L-2,  oil  service . 

600 

’..8 

2.3 

Trailer,  fuel-servicing . 

600 

1.8 

2.3 

Transporter,  liquid,  rolling-wheel  type 
(RLT).  1,000- gal  T3* . 

1,000 

3.0 

3.7 

Truck,  tractor  and  trailer,  F-l . 

4,000 

12.2 

15.2 

Truck,  tractor  and  2  trailer!,  F-1A . 

8.000 

24.4 

30.4 

Truck,  tractor  and  trailer,  F-2 . 

2,000 

6.1 

7.6 

Truck,  tractor  and  2  trailers,  F-2A  . . . 

4.000 

12.2 

15.2 

'  Mold*!  hails. 

1  Rcetnnaular  halls. 

*  «xlfl  ponton  harm  rarryinp  them  4Z.OOO."llon  tsnks  load'd  to  two. thirds  rapacity. 

*  In  mnlntainln*  tho  sons  vidumsm.  pipeline  capacity  far  pnooUno  sad  oil.  mors  prseeare  is  required  for  the  hanrior  liquid. 

*  Bas'd  on  12.500  cm  Ho  os  por  hour  for  20  hours  of  ops  ratios.  In  sn  mircrner  it  con  deliver  M.non  rations  per  hour  for  24  hours  of 
alteration,  or  720.000  rations  por  dap. 

*  The  ship  tanker  moot  commonly  mod  is  tils  T2-SE-A1,  a  4.022.0  OO-miinn  tanker.  It  I'  12'.  feet  ions  and  draws  II  rest.  It  hns  three 
4-IUneed  dlscharps  outlets  sad  four  disc  hares  pumps  rated  1.000  cpm  at  100  pel. 

*  Draft  Isodad.  II  to  20  foot. 

*  40  fast  Ions.  12  feat  wide.  1  foot  hlsh  when  Oiled.  When  empty,  it  ran  he  rolled  to  20  inches  hy  it  rest :  10  ran  he  carried  In  o  M  trork. 

*  A  pelr  ef  removable  synthetic-rubber  containers  (foot  eeilsl  mounted  on  on  sale  and  marine  unit.  Each  eoH  hna  a  rapacity  of  MO  sa Ilona. 


5-6  CgMTER-or -GRAVITY  DETERMINATION 

Xh«  military  transportation  officer  is  often  confronted  with  the 
problem  of  shipping  vehicles  or  other  wheeled  devices.  This  almost  always 
involves  the  computation  of  the  vehicle's  center-of- gravity.  The  following 
descriptions  provide  a  simple  step-by-step  approach  to  accomplish  this  very 
necessary  task. 


VMt  ccmcr-of-gnvky 
L  —  Wbcdbaae 
•i  ■  Fraei  ask  load 
w,  ■  Rear  ask  bod 
W  »  Total  wdsht  of  aait 


TO  IOCATI  CARGO  UNIT  CBfTBI  Of 
GRAVITY,  MTBMINf  X 


and  wj. 


Vduck  sm«  be  bed  whoa 


fill 


X  Ptttrdin  i  total  weight  of  oak  fW)  by 
aab  loadi  (*,  and  wj. 


Pgjggii 


•aab  load  (wj  by  whwlbaaa  (U. 


(X)  by  dfrtdbg 


RSMII 


r S rs 
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5-7  MISCELLANEOUS  DATA 

The  following  lists  contain  information  that  is  particularly  useful  in  the 
field  of  military  transportation. 


DIMENSIONS  OF  CONTAINERS 

Size  of 


Nomenclature 

Units 

in 

Package 

Type 

of 

Package 

Length 

package  ( in 
Hidth  or 
Diameter 

.) 

Height 

Drum: 

U.S.  55-gal.,  16-gage 

1 

Drum 

0 

24  1/8 

34 

3/4 

U.S.  55-gal.,  18  gage 

1 

Drum 

0 

24  5/8 

34 

7/8 

Can: 

U.S.  5-gal.,  (gasoline) 

1 

Can 

13  3/4 

6  1/2 

18 

3/8 

U.S.  5-gal.,  (oil) 

2 

Case 

0 

11  15/16 

14 

3/16 

U.S.  5-qt .  (oil) 

6 

Case 

0 

14 

10 

U.S.  1 -qt • ,  (oil) 

12 

Case 

18 

13 

6 

Pail: 

U.S'.  25- lb  (grease) 

1 

Pail 

0 

11  1/2 

11 

1/2 

Fluid  in  Pipeline 


HEIGHTS  OF  LIQUIDS  AND  GASES 


Pounds  per  cu  ft 


Liquids : 

Aviation  gasoline - - - - - - — - - — — —  44. 1 

91A  gasoline - — — — - - - - - — — — - 45.7 

Kerosene— ——————————————————  50.8 

Diesel  oil - 52.3 

Lubricating  oil - - - - — — - - — - - 56.8 

Hater,  fresh - - — - - - - - — - — - - 62.4 

Hater,  sea - - - - - — - 64.0 


Gases: 

Air -  0.07658 

Ammonia - 0.04509 

Benzene - - - - - —  0.20640 

Butane - 0.15350 

Carbon  dioxide— ——————————————  0.11637 

Chlorine -  0.18750 

Ethylene - - - - — - -  0.07410 

Helium— ——————————————————  0.01058 

Hydrogen— - - - — — - - - — - -  0.00530 

Methane - 0.04234 

Natural  gas - - — - - - - - - — - - 0.05140 

Oxygen- - -  0.08463 

Propane- - — - - - - - - - 0. 11645 

Steam - 0.04761 
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BASIC  STATISTICAL  CONCEPTS 


INTRODUCTION 


Basic  statistical  concepts  which  aura  of  use  to  the  military  logisti¬ 
cian  are  reviewed  in  this  chapter,  included  under  each  topic  area  aura  a 
general  description  of  the  technique,  algebraic  principles  and  formulas , 
and  at  least  one  example  of  a  specific  application. 

The  techniques  presented  in  this  chapter  should  not  be  considered  as 
the  only  input  to  the  decision-making  process.  Statistics  can  neither 
prove  nor  disprove  anything.  Their  primary  purpose  is  to  provide  a  clear, 
logical  approach  to:  (1)  the  conversion  of  raw  data  into  meaningful  infor¬ 
mation,  (2)  provide  support  or  credibility  to  a  particular  management  deci¬ 
sion,  or  (3)  narrow  the  list  of  alternative  solutions  to  a  problem. 

DEFINITIONS 

Some  of  the  basic  terms  used  in  the  discussion  of  probability  and  sta¬ 
tistics  are  defined  below. 

Experiment.  An  experiment  is  any  process  of  observation.  A  random  experi¬ 
ment  is  one  in  which  the  outcome  cannot  be  predicted  with  certainty,  i.e., 
the  outcome  is  a  chance  occurrence  as  in  tossing  a  die. 

Sample  Space.  The  set  of  all  possible  outcomes  of  a  random  experiment.  If 
the  experiment  is  the  toss  of  a  die,  the  sample  space,  S  *  {1,2, 3, 4, 5, 6}. 

Event.  An  event  is  a  subset  of  the  sample  space  and  may  include  one  or 
more  of  the  possible  outcomes.  If  an  event  contains  exactly  one  sample 
point  or  outcome,  it  is  called  a  simple  event.  If  it  contains  more  than 
one  sample  point,  it  is  called  a  compound  event.  In  a  toss  of  a  die,  a 
compound  event  Bj,  could  be  defined  as  an  odd  number.  If  a  1,  3,  or  5 
occurs,  then  event  Ej  will  have  occurred. 

Events  are  said  to  be  mutually  exclusive  if  they  cannot  occur  simul¬ 
taneous  ly ,  i.e.,  if  their  intersection  is  the  null  or  empty  set  {(if). 

Events  are  collectively  exhaustive  if  the  union  of  two  or  more  events 
comprise*  the  entire  sample  space. 

When  two  or  more  events  are  both  mutually  exclusive  and  collectively 
exhaustive,  they  form  a  partition  in  which  one  of  the  set  of  outcomes  of  an 
experiment  must  occur,  but  only  one  can  occur  in  any  single  trial. 


Probability .  Probability  (P)  is  a  real  number  which  describes  the  likeli¬ 
hood  of  occurrence  of  an  event  (E)  from  a  sample  space  (S)  such  that  the 
following  axioms  hold: 

1.  0  <_  P(E)  <_  1 

2.  P(S)  -  1 

3.  P^UEj)  -  P(Ei)  +  PCE^)  where  P(E.r|E^)  »  0 

A  Priori.  The  classical  definition  of  probability  is  based  upon  the  notion 
of  equally  likely  outcomes  of  an  experiment.  Probabilities  are  found 
through  prior  knowledge  by  enumerating  the  entire  finite  sample  space.  For 
example,  in  the  toss  of  fair  coin,  it  is  known  a  priori  that  the  probabi¬ 
lity  of  a  head  occurring  is  1/2. 

Relative  Frequency.  Probabilities  can  also  be  calculated  by  observing  out¬ 
comes  over  a  large  number  of  trials  when  the  probabilities  are  not  known  a 
priori.  Thus,  the  relative  frequency  probability  of  an  event  E  is  defined 
as: 


m 

P  (E)  -  ^ 
n 

where:  m  »  the  number  of  occurrences  of  event  E 
n  *  the  total  number  of  trials 

Subjective.  Probabilities  may  be  determined  judgmentally  or  based  upon  the 
personal  experience  of  the  manager  of  management  group.  For  example ,  in 
order  to  assess  the  impact  of  pending  teamster  contract  negotiations,  a 
transportation  officer  could  assign  probabilities  to  the  possible  outcomes, 
such  as: 


Event 

P 

Settlement 

.6 

Lengthy  arbitration 

.3 

Labor  strike 

.1 

1.0 

Note  that  the  foregoing  example  constitutes  a  probability  distribution  for 
the  possible  outcomes  in  that: 

1.  The  probabilities  sum  to  unity,  i.e.,  EP  ■  1. 

2.  The  events  are  mutually  exclusive,  i.e.,  no  two  events  can  occur 
at  the  same  time. 

3.  The  probability  of  each  event  is  between  zero  and  one. 

4.  The  events  are  collectively  exhaustive,  i.e.,  based  on  the  mana¬ 
ger's  experience;  no  other  outcomes  are  possible  in  this  case. 


Representation  of  Outcomes.  Probabilistic  outcomes  can  be  represented  in 
standard  set  notation,  tree  diagrams,  or  Venn  diagrams  as  follows: 

Set 


E.g.  Single  toss  of  coin: 
S  -  {H,T} 

E.g.  Toss  coin  twice: 

S  -  {HH,HT,TH,TT} 


E.g.  Roll  a  die: 

S  -  {1,2, 3, 4, 5,6} 


Counting  Principles.  Fundamentally,  probabilities  are  computed  by  finding 
the  ratio  of  favorable  outcomes  to  total  possible  outcomes, 

or  P(E)  -  - 
n 

which  is  the  relative  frequency  definition  provided  earlier.  To  evaluate 
and  n,  basic  counting  principles  are  used.  Simply  defined:  If  there  are 
different  ways  to  accomplish  event  A, and  Y  different  ways  to  to  accomplish 
event  B,  then  there  are: 

X  +  Y  ways  to  accomplish  A  or  B 
X  •  Y  ways  to  accomplish  A  and  B 

Permutation .  A  set  of  elements  arranged  in  some  order.  Consider,  for 
example,  the  first  four  digits  of  a  federal  stock  number.  The  number  1100 
is  different  than  1010  which  contains  the  same  digits.  Thus,  when  the 
order  in  which  the  values  appear  is  important,  the  rules  for  permutations 
should  be  followed.  In  general,  the  permutation  of  n  things  taken  r  at  a 
time  is  found  by: 

n! 

nr  ”  (n-r) ! 

where  n!  (n  factorial)  is  the  product  of  all  the  natural  numbers  from  1 
to  n.  Thus,  n!  -  n(n-l) !  and,  by  definition  0!  -  1.  In  the  example  of  the 
first  four  digits  of  a  stock  number,  assuming  that  digits  camot  be 
repeated  and  that  zero  can  occupy  any  position  including  the  first,  the 
number  of  possible  stock  numbers  can  be  found  by: 


Tree  Diagram 


Venn  Diagram 
H  T 

o  o 

H  T 

CD 

12  3 

o  o  o 

4  5  6 

O  O  o 


a  x 


10  4  (10-4) ! 


10 -9-8 -7  -  5040 


In  the  special  case  when  n  ■  r,  ■  n! 

Repetitions .  Another  special  case  occurs  when  permuted  objects  involve 
repetitions.  For  example,  suppose  a  code  requires  the  use  of  only  the 
letters  KBBRB .  The  number  of  5-letter  "words"  which  can  be  formed  using 
those  letters  is  found  by: 

p  «  - — - 

n  n,n„ . . . n  n _ In. 1 . . .n  1 

12  r  12  r 

which  is  the  permutation  of  n  objects  of  which  are  alike,  are  alike, 
...,  n  are  alike.  In  the  example,  the  letter  B  appears  3  times,  so  there 
is  no  way  to  distinguish  between  them.  The  number  of  different  words  is: 

5 1 

31111!  “  20 

4  ^ 


(no.  of  Bs)  (no.  of  Ks)  (no.  ce  Rs) 

Permutations  are  especially  useful  in  statistical  sampling. 

Sampling  with  Replacement.  How  many  ways  can  5  cards  be  selected  from  a 
deck  of  52  if  each  card  is  replaced  after  it  is  drawn?  The  total  number 
of  possible  outcomes  (sample  space)  is: 

nr  -  525  -  380,204,032 

Sampling  without  Replacement.  If  the  cards  are  not  replaced: 


521 

n  r  "  52  5  "  (52-5)  1 


311,875,200 


Combination :  A  set  of  n  objects  taken  r  at  a  time  in  which  order  is  not 
important  can  be  found  by: 


r  (n\  Si 
nr  *  yr  J  rl  (n-r) 

that  C  »  /  P  \  /  . 
nr  \n  r)  /rl 


EXAMPLE  6-1.  A  new  USAF  briefing  team  is  being  established  which  will 
consist  of  2  majors  and  2  captains.  If  there  are  5  majors  and  8  captains  to 
choose  from,  in  how  many  ways  can  the  team  be  selected?  Since  order  is  not 
important,  there  are  gC_  ways  to  choose  majors  and  gC2  ways  to  choose  captains 
or  _C.  •  0C„  ways  to  select  the  team.  * 


c 

I 

1 


H  wv 


i 


8=2  -  (l  )  ■  67JT  ■  28 

or  10.28  ■  280  ways  in  which  a  team  can  be  selected. 

Computing  Probabilities .  Events  are  said  to  be  independent  when  the  out¬ 
come  o£  one  trial  in  no  way  influences  the  outcome  of  another.  For 
example,  in  the  toss  of  a  fair  coin,  a  head  appearing  on  the  first  toss 
will  not  affect  the  probability  of  obtaining  a  head  on  the  second  toss. 

For  independent  events,  the  probability  of  the  joint  occurrence  of  two  or 
more  events  is  obtained  via  the  special  rule  of  multiplication .  If  A  and  B 
axe  independent  events,  then  the  probability  of  their  joint  occurrence 
(their  intersection  or  f|  )  is  givrn  by  the  product  of  their  marginal  proba¬ 
bilities.  P(AflB)  ■  P (A)  •  P (B)  .  P(Af)B)  is  read,  "the  probability  of 
both  A  and  B  occurring." 

EXAMPLE  6-2.  In  an  experiment  consisting  of  two  tosses  of  a  coin,  what  is  the 
probability  of  obtaining  two  heads?  A  priori  it  is  known  tha_  the  probabi¬ 
lity  of  obtaining  a  head  (or  tail)  in  a  toss  of  a  fair  coin  is  0.5.  To 
obtain  two  heads  in  two  tosses,  appropriately  assuming  that  each  toss  is 
independent,  Pd^fl^)  -  P(H1)  •  P(H2) 

*  0.5  •  0.5 

*  0.25 

Conditional  Probability  is  the  probability  that  some  event  occurs  given 
that  some  other  event  has  already  occurred,  or  P(a|b)  is  read,  "the  proba¬ 
bility  of  event  A  given  that  event  B  has  already  occurred."  The  con¬ 
ditional  probability  of  A  given  B  is  found  using  the  product  rule: 

P(a|b)  -  ,  for  P (B)  jt  0 

If  A  and  B  are  independent, 

P(a|b)  -  P (A) 

and  P(A()B)  -  P(a|b)  *  P(B) 

*  P(A)  •  P (B)  as  above 

An  important  aspect  of  conditional  probability  is  reduced  sample  space. 

This  occurs  when  samples  are  selected  without  replacement.  It  should  also 
be  noted  that  independent  events  are  not  mutually  exclusive. 

Other  rules  and  relationships  may  be  useful  in  determining  probabilities. 

Complements .  A  is  read  "not  A".  P(A)  ■  1  -  (P(A) 

De  Morgan's  laws:  P(Afl8)  -  P(AUB)  ■  l-P(AflB) 

P (AU  B)  -  l-P(Ani) 
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EXAMPLE  6-3.  Sixty-seven  officers  are  available  for  reassignment  as  logisti¬ 
cians.  Of  these,  47  have  had  supply  experience,  35  have  had  maintenance 
experience,  and  23  have  had  both  supply  and  maintenance  experience.  If  the 
66XX  career  counselor  selects  one  at  random,  what  is  the  probability  that 
the  officer  selected  has  had  neither  supply  nor  maintenance  experience? 

Let  S  -  event,  officer  had  supply  experience 

M  *  event,  officer  had  maintenance  experience 

It  is  known  from  above  that 

PIS)  -  11  PM)  -  ||  PlSflM)  -  || 

We  wish  to  find  P(sf)M)  Using  De  Morgan's  law: 


P(M)  -  || 


P(SUM) 

-  i  - 

p(snM) 

or 

P(SflM) 

-  i  - 

p(sUm) 

Since 

P(SUM) 

-  P(S) 

+  P(M) 

-  P(SnM) 

Then 

p(?nM) 

-  1  - 

[P(S)  + 

P (M)  -  P(SflM)] 

“  1  - 


47  +  35  -  23 


-  1  -  .88 
-  0.12 

It  can  also  be  concluded  that  (.12) (67)  or  8  of  the  67  officers  have  had 
neither  supply  nor  maintenance  experience. 

Bayes'  Theorem.  An  extension  of  conditional  probability  states  that  . . . 
"Given  B^,  B2  ...  B  mutually  exclusive,  collectively  exhaustive  events 
let  A  be  an  arbitrary  event  in  the  universe  such  that  P(A)  f  0,  then 


PlSjlA)  -- 


P(A  B  , )  -PtBj 


Z  P(B  ).P(A|B.) 
i-1  1  1 


where  i  »  1,  2,  . . . ,  n 

j  *  1,  2,  ...,  orn 

determines  the  posterior  probability,  P(B  |a).  Uncertainty  is  reduced 
through  the  use  of  additional  information3 from  past  samples,  experiments, 
etc.,  to  modify  prior  probabilities . 
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EXAMPLE  6-4 .  A  certain  component  is  produced  by  three  manufacturers .  The 
total  output  is  purchased  by  the  Defense  Logistics  Agency.  Of  the  total 
production,  45%,  20%  and  35%  are  produced  by  manufacturers  A,  B,  and  C, 
respectively.  From  historical  records,  it  is  known  that  10%,  5%,  and  1% 
of  the  production  is  defective  from  manufacturers  A,  B,  and  C,  respectively. 
If  the  user  finds  a  part  which  is  defective,  what  is  the  probability  that 
it  came  from  manufacturer  A? 

Let  D  =  event,  part  is  defective 

A  =  event,  part  was  made  by  A 

B  =  event,  part  was  made  by  B 

C  =  event,  part  was  made  by  C 

What  is  P(A|D)? 

Using  Bayes'  theorem 

p(AiDl  „  _ P(d1a)P(A) _ _ 

'  P(D|A)P(A)  +  P(D|B)P(B)  +  P(D|C)  P(C) 

_ (.10)  (.45) _ 

(.10)  (.45)  +  (.05)  (.2)  +  (.01)  (.35) 

=  0.77 

Using  the  information  that  "the  part  is  defective,"  the  prior  probability 
that  the  part  came  from  manufacturer  A  is  revised  from  0.45  to  0.77. 


y.  -  E(k)  =  £kP(k) 

K  k 

where  u  *  mean  or  average  of  all  the  values  of  k,  a  discrete  random 
variable 

E(k)  =  expected  value  of  k,  another  name  for  the  mean 

P(k)  =  probability  associated  with  each  value  of  k. 

Note  that  the  mean  is  a  weighted  average  in  which  the  values  of  k  are 
weighted  by  their  respective  probabilities. 

Variance . 


o2  =  var(k)  *  E  [ck-V^)2 

-  £(k-u  )2P(k) 
k  K 

The  variance  is  referred  to  as  the  second  moment  and  expresses  the 
expected  value  of  the  squared  deviations  of  k  from  its  mean,  also 
weighted  by  the  probabilities  of  k. 

A  number  of  discrete  probability  distributions  have  been  found  to  occur 
quite  frequently  in  nature  to  describe  the  outcomes  of  random  processes. 
Several  of  these  which  have  specific  application  in  the  many  functions  of 
logistics  are  described  below.  The  reader  should  be  cautioned,  however, 
that  the  theoretical  probability  distributions  given  herein  may  not  com¬ 
pletely  describe  a  logistics  process.  In  such  cases,  erroneous  conclusions 
could  be  drawn  from  their  use.  In  many  instances,  however,  use  of  the 
following  distributions  can  and  have  provided  valuable  inputs  to  the  com¬ 
plex  area  of  logistics  decision  making. 

Uniform  Distribution 


Definition.  A  discrete  probability  distribution  where  all  the  integers 
from  a  through  8  have  an  equal  probability  of  occurrence.  This  distribution 
stems  from  equally  likely  probability  in  which  all  values  of  a  random 
variable  are  equally  likely  to  occur. 

Probability  mass  function. 

P  (k)  -  - B^o+1 - '  k  *  o*  o+l,  a+2,  ’  *  * ,  8 


where  k  is  a  discrete  random  variable 


Parameters .  a 

8 

Mean  s  y^ 


the  smallest  value  of  k 
the  largest  value  of  k 


g+B 

2 


Variance,  cr £  =  V(k) 


S-g+1 


(B-g) (B-a+2) 
12 


Cumulative  distribution  function. 

F(k0)  *  P(k  <  kQ)  =  (k0-g+l) 

where  kQ  is  a  particular  specified  value  of  the  random  variable  k. 


Uses. 

1.  This  distribution  could  be  used  in  a  random  sampling  process 
as  a  means  to  determine  whether  or  not  a  given  sample  is  random. 

2.  In  the  absence  of  probabilistic  information  for  any  discrete 
process,  the  uniform  distribution  could  be  used  to  provide  probabilities 
of  the  outcomes  of  the  process. 
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EXAMPLE  7-1.  At  an  aerial  port  the  number  of  pallets  of  cargo  awaiting 
airlift  on  any  given  day  has  am  unknown  distribution  with  no  less  than  four 
nor  more  than  eleven  on  hand. 


a.  What  is  the  expected  number  of  pallets  on  any  given  day? 
11+4 


uk  -  E(k) 


7.5  pallets 


b.  What  is  the  probability  that  more  than  seven  pallets  will  be  await¬ 
ing  airlift? 

P(k  >7)  «  P(k  >  8)  -  1-P  (k£7) 


1-  (7-4+11 


11-4+1 


*  0.5 


(Incidently,  this  is  the  same  probability  of  having  seven  or  less  pallets 
awaiting  airlift.) 

This  information  might  be  useful  to  the  traffic  manager  for  scheduling 
airlift,  loading  equipment,  and  manpower  during  periods  of  normal  operations 
During  emergency  operations  such  as  combat  support,  deployment,  or  relief 
operations  as  a  result  of  a  national  disaster,  this  distribution  would  not 
be  applicable  as  it  assumes  an  even  flow  of  pallets  and  could  not  account 
for  peaks  and  valleys  in  the  work  load. 

Binomial  Distribution 


Definition.  A  discrete  probability  distribution  which  expresses  the 
probability  of  success  associated  with  the  outcomes  of  n  Bernoulli  trials, 
where : 


1.  A  Bernoulli  trial  is  an  experiment  with  only  two  outcomes, 
usually  termed  "success"  or  "failure." 

2.  Each  trial  is  independent. 

3.  T  ere  is  a  constant  probability  of  success  for  each  trial. 

4.  There  is  a  fixed  number  of  repeated  trials. 

Probability  mass  function. 

P(k)  "  (k  j  P  (1_p) 

where  k  =  the  number  of  successes,  and 

/n\,  - Qj - 

\  ky  kl (n-k) I 
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Parameters .  n  -  the  number  of  trials 

p  “  the  probability  of  success  for  each  trial 
Mean.  *  E(k)  «  np 

Variance .  c£  ■  V(k)  *  np(l-p) 

Cumulative  distribution  function.  The  probability  of  obtaining  k  or 
fewer  successes  in  n  trials  can  be  found  by: 

*o 

m  l  fl)  P1C<l'P)n‘k  *  for  k  -  Q,l,2,”*,  k 
0  '  '  ° 

Cumulative  probabilities  can  be  found  more  easily  in  widely  published  cumula¬ 
tive  binomial  probability  tables. 

Uses.  This  distribution  may  be  used  with  any  Bernoulli  process  (two 
outcomes)  when  sampling  from  an  infinite  (or  very  large)  population,  or  when 
sampling  with  replacement  from  a  finite  population. 

EXAMPLE  7-2.  A  contractor  has  produced  a  certain  part  in  large  quantities  for 
the  Defense  Logistics  Agency.  Historically  it  has  been  found  that  5  percent 
of  the  parts  are  defective.  In  the  latest  shipment  of  30  parts,  what  is  the 
probability  that  at  least  one  defective  will  be  found? 

The  probability  of  at  least  one  defective  is  given  by: 

P(k  <  1)  or 

1-P (k»0)  where  k  is  the  number  of  defectives 

-  1  “  £(30°)  (0.05) 3  (0.95)3Cj 

-  0.786 


Poisson  Distribution 


Definition.  This  distribution  is  the  limiting  case  of  the  binomial 
distribution  and  can  be  used  to  approximate  the  binomial  particularly  when 
n  )  100  and  p  (  0.10.  The  Poisson  is  also  the  distribution  of  the  number  of 
independent  occurrences  of  an  event  over  a  specified  interval  of  time, 
volume,  or  space. 


7-5 


Probability  mass  function. 

.  .k  -n 

POO  *  -n^| —  e  ,  which  is  the  limit  of  the  binomial  distribution 

as  n  +  •  and  p  -*■  0. 

where  k  *  the  number  of  occurrences  of  the  event  in  the  interval 
n  *  total  number  of  intervals  in  the  sample 
p  ■  probability  of  an  occurrence  in  the  interval 
More  appropriately,  the  mass  function  is  given  by: 


where  \  ■  np 

Mean.  y_  =  E(k)  =  X 
K 

Variance .  V(k)  =  <r£  ■  X 

If  the  Poisson  is  used  in  instances  other  than  for  approximating  the 
binomial,  the  following  conditions  must  be  met: 

1.  Events  can  occur  at  any  point  in  the  continuum  of  space  or  time. 

2.  There  is  a  small  probability  of  occurrence  of  a  single  event  in  a 
given  interval. 

3.  The  probability  of  occurrence  is  constant  for  each  interval. 

4.  The  number  of  occurrences  in  one  interval  is  independent  of  the 
number  in  any  other  interval. 

Uses.  The  Poisson  distribution  is  often  used  to  describe 

1.  the  number  of  arrivals  in  a  given  time  period  at  a  queue; 

2.  the  number  of  failures  of  a  component  during  a  given  time  period;  and 

3.  the  number  of  defects  in  a  given  length  of  material. 

EXAMPLE  7-3.  An  airbase  has  an  average  of  two  unscheduled  landings  per 
hour  for  any  given  24-hour  period.  If  the  runway  must  be  closed  for  six 
hours  for  repair,  what  is  the  probability  that  two  or  less  unscheduled  land¬ 
ings  must  be  diverted  to  another  facility? 
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If  an  average  of  2  unscheduled  landings  occur  per  hour  then 
X  -  2  •  6  -  12 

can  be  expected  to  occur  in  any  6-hour  period.  Letting  X  *  12 
P(k  <  2)  -  P (k  =  0)  +  P(k  -  1)  +  P (k  »  2) 

2 

or  “I  P(k) 
k=0 


12°  -12  121  e"12  .  122  e'12 

—  e  +  IT”  2l~ 


*  0.0005 

Conversely,  there  is  a  (1.0000  -  .0005)  X  100  or  99.95  percent  chance  that 
more  than  2  unscheduled  aircraft  will  arrive. 


Definition.  A  discrete  probability  distribution  which  is  somewhat  like 
the  binomial  except  that  the  items  in  a  sample  are  drawn  without  replacement. 
The  probability  of  success  is  not  constant  for  each  trial. 


Probability  mass  function. 


where  k  is  a  discrete  random  variable,  the  number  of  success 
sample. 


Cumulative  distribution  function. 


F(ko)  -  P (k  <  kQ)  -  l  P (k) 

k=>  o 


/N-NjN  (N-N^)  ! 

\  n-k  /  *  (n-k) ! (N-N^-n+k) j 
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Parameters . 


”  total  number  of  successes  in  the  population 

n  -  sample  size 
N  *  population  size 

nN 

Mean,  =  E(k)  -  ■— =■ 

Vjrijiice. 


Uses.  The  hypergeometric  distribution  is  used  when  only  two  outcomes 
are  possible  for  any  given  trial  and  when  samples  are  drawn  without  replace¬ 
ment.  When  samples  are  drawn  with  replacement,  the  binomial  distribution 
is  used  instead. 

EXAMPLE  7-4.  A  squadron  is  preparing  for  a  deployment  exercise.  A  WRSK  kit 
contains  10  of  a  particular  replacement  component.  In  past  deployments,  an 
average  of  4  of  the  components  has  been  found  to  be  defective.  Due  to  weight 
limitations,  only  5  of  the  components  will  accompany  the  next  deployment 
exercise.  If  2  of  the  components  will  be  used  during  the  deployment,  what 
is  the  probability  that  at  least  2  of  the  5  components  in  the  kit  will  be 
serviceable?  If  k  *  the  number  of  serviceable  components  in  the  "sample"  of 
5  taken  on  the  deployement,  the  probability  that  at  least  2  of  the  components 
are  serviceable  can  be  found  by: 

P(k>  2)  or  1  -  P(k  <  1) 

-  1  -  [P(k»o)  +  P(k=l)  ] 


0.97619 


Continuous  Probability  Distributions 


4 


1C 

1 

1 


1  * 


l! 


I 


A  continuous  random  variable  can  take  on  an  infinite  number  of  values 
over  a  sample  space.  For  example,  the  number  of  weights  between  180  and 
181  pounds  is  theoretically  infinite,  but  in  a  practical  sense  limited  by 
the  sensitivity  of  the  weighing  (or  measurement)  device. 

Since  a  continuous  random  variable  cam  take  on  am  infinite  number  of 
values,  the  probability  that  a  single  value  will  be  observed  is  essentially 
zero ,  or 

P(X  -  XQ)  =  0 

The  probability  distribution  of  a  continuous  random  variable  is  repre¬ 
sented  by  a  probability  density  function  (PDF)  called  f Cx) .  Graphically, 
the  PDF  for  a  continuous  ramdom  variable  x  is  illustrated  below: 


For  a  continuous  random  variable  x,  the  PDF  must  exhibit  the  following 
properties 


1.  f(x)  >  o 


f  f  (x)  dx  =  1 


for  all  x 
c  <  x  <  d 


Probability  is  measured  by  area  under  the  curve  described  by  f(x),  known  as 
the  cumulative  distribution  function  (CDF)  or  F(x),  where 

F (a)  =  P(x  <_  a)  =  fc  f  (x)dx 

which  is  the  probability  that  the  random  variable  is  less  than  or  equal  to 
some  value  a.  In  addition  to  the  properties  of  the  PDF  above,  the  following 
properties  hold  for  the  CDF. 


1.  P(x  <  c)  -  F (c)  »  0 

2.  P(x  >  d)  -  0 


3.  P(x  <  d)  -  FCd)  *  1 
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It  should  be  noted  that  the  probability  density  function  is  the  first 
derivative  of  the  cumulative  distribution  function. 


f<x) 


dr<x) 

dx 


Consider  the  following  CDF: 


0,  x  <  0 


P(x)  ■  (r,  0  <  x  <  1 


1,  x  >  1 


The  shaded  area  represents  the  probability  that  a  continuous  random  variable 
x  is  less  than  0.5,  and  can  be  found  by: 

f  0.5 

F  (0. 5)  =  P(x  <  0.5)  *  JQ  ftx)  dx 


L 


0.5 


2xdx 


Note  that  2x 


afcdl 

dx 


or  f(x) 


F  (0.  5)  -  F  (O) 

dF  (x) 
dx 


0.25 


The  mean  and  variance  are  useful  descriptors  of  any  probability  distribution 
These  can  be  found  using  the  methodology  described  above. 
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Mean,  u  ■  £{x)  -  /  xf(x)<3x 
Using  the  example  above: 


y 

x 


(x) (2x)dx 

2x2dx 


=  F(l)  -  F (0  ) 


=  0.667 


Variance. 

°x  =  /c  (x“ux)2 

=  f  x2f(x)dx  -  y  2 
Jc  X 


Using  the  example  above: 


X 


w 


I 


I 


I 


« 


I 


The  probability  distributions  of  continuous  random  variables  and  hence  their 
probabilities  can  be  found  in  a  similar  manner.  However,  there  are  a  number 
of  probability  distributions  which  occur  quite  often  in  the  area  of  logis¬ 
tics  as  well  as  others.  Probabilities  have  already  been  computed  and  can  be 
found  in  published  tables.  The  more  common  continuous  probability  distri¬ 
butions  useful  to  logisticians  are  contained  in  the  sections  which  follow. 

Normal  Distribution 


Definition.  A  continuous  probability  distribution  described  by  a 
smooth,  symmetric,  bell-shaped  curve  which  is  completely  specified  by  its 
mean  and  variance.  One  of  the  most  useful  probability  models,  it  is 
characterized  by  the  following: 

1.  It  is  symmetric  about  its  mean,  y. 

2.  It  has  an  infinite  range,  i.e.,  unbounded,  so  that  any  interval  of 
numbers  will  have  a  positive  probability. 

3.  The  density  curve  falls  off  quickly  as  values  deviate  from  the  mean. 
In  fact,  more  than  99  percent  of  the  area  under  the  normal  curve  is 
encompassed  by  y  ±  3o ,  which  makes  this  distribution  a  good  approximation 
of  other  distributions  such  as  the  binomial  or  Poisson. 

4.  The  distribution  is  completely  defined  by  its  mean  and  variance. 

A  change  in  the  mean  displaces  the  distribution  to  the  right  or  left;  a 
change  in  the  variance  altars  its  shape. 

5.  It  exhibits  the  property  of  additivity;  i.e.,  the  sum  of  independent 
normal  variables  is  also  a  normal  variable. 

6.  Probabilities  are  computed  from  the  standard  normal  distribution 
which  has  a  mean  of  zero  and  variance  of  one  and  can  be  defined  as: 


Z  * 


x-y 

o 


where 


Z  =  standard  normal  variable 
X  a  normally  distributed  variable 
y  *  mean  of  the  normally  distributed  variable 
a  *  standard  deviation  of  the  normal  variable 
Probability  density  function  (standard  normal). 

_Uq>2 

•  e  ^  for  -  «  <  Z  <  +  « 


/ 


4 
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mean  =  0 


Parameters .  y_  » 

z 

=  variance  *  1 

Cumulative  distribution  function.  Cumulative  probabilities  are  found 
for  any  normal  variable  by  first  converting  it  to  a  standard  normal  variable 
as  described  above  and  using  standard  normal  probability  tables  found  in 
any  basic  statistics  text. 

Uses. 

1.  The  distribution  of  measurement  errors  is  usually  normal. 

2.  Sample  statistics  such  as  the  mean  are  distributed  normally, 
which  is  ideal  for  making  statistical  inferences. 

3.  Corrective  maintenance  times  are  often  normally  distributed. 

4.  The  normal  is  a  good  approximation  to  binomial  and  Poisson 
variables  when  sample  sizes  are  large. 

5.  The  use  of  the  normal  distribution  often  yields  satisfactory 
results  even  when  variables  are  not  normally  distributed. 

EXAMPLE  7-5.  A  manufacturer  estimates  that  its  truck  tires  have  a  lifetime 
which  is  normally  distributed  with  a  mean  of  13,000  miles  and  standard 
deviation  of  2000  miles.  If  it  guarantees  to  an  Air  Force  buyer  that  the 
tires  will  last  10,000  miles,  what  percentage  of  tires  can  be  expected  to 
wear  out  during  the  guarantee  period? 

First  it  is  necessary  to  use  the  trans formation 


X-u  =  10,000  -  13,000 
o  2,000 


-1.5 


therefore 

P(X  <  10,000)  «  P(Z  <  -1.5)  »  0.0668 

As  a  result,  the  Air  Force  buyer  can  expect  that  6.68%  of  the  tires  will 
wear  out  prior  to  the  expiration  of  the  warranty  period. 


Lognormal  Distribution  / 

Definition.  The  continuous  random  variable  x  is  said  to  have  a 
lognormal  distribution  if  the  log  x  is  normally  distributed  and  x  >  o. 


Use3.  The  repair  times  of  many  equipment  items,  particularly  electronic 
equipment,  are  often  skewed  right  due  to  the  extensive  repair  times  of  a  few 
maintenance  actions.  In  these  cases,  repair  times  often  follow  lognormal 
distributions  in  which  the  logarithms  of  repair  times  (not  the  repair  times 
themselves)  are  normally  distributed.  Probabilities  are  calculated  using 
the  procedures  for  the  normal  distribution. 

EXAMPLE  7-6 .  In  a  problem  dealing  with  repair  times  of  a  component,  the 
logarithm  of  each  repair  time  is  used  to  determine  probabilities,  instead  of 
actual  repair  time.  Thus,  the  mean  and  variance  of  the  distribution  can  be 
found  by : 

N 

I  log  xi 

Mean:  y.  =  E(log  x)  =  : - 

-  log  x  N 


here  x. 
1 


.  .  .  . th 

=  time  to  repair  i 


item 


N  =  number  of  items  repaired 


The  mean  time  to  repair  (MTTR)  can  be  found  by  taking  the 

In  essence,  the  mean  of  the  lognormal  distribution  is  the 
the  actual  observations  of  repair  times. 


antilog  of  y, 

’  log  x 

geometric  mean  of 


Variance : 


a 


2 

log  x 


N  N  2 

I  dog  x  )2  -  (l  log  x  )  / 

i=*l _  !■!  N 

N 


Where  x.  and  N  are  the  same  as  above. 

i 


Exponential  Distribution 

Definition.  A  continuous  distribution  which  is  a  special  case  of  the 
gamma  distribution.  If  the  distribution  of  discrete  events  is  Poisson,  the 
time  (or  interval)  between  the  occurrences  of  such  events  will  be  exponen¬ 
tially  distributed  under  the  following  conditions. 

1.  The  medium  of  measurement  is  continuous  such  as  time,  length, 
or  volume. 

2.  A  subset  of  the  medium  can  be  defined  such  that  the  probafcii:* 
of  two  or  more  occurrences  in  the  subset  is  zero. 

3.  The  probability  of  occurrence  of  an  event  in  a.iy  subs,-' 
medium  is  independent  of  the  occurrence  in  ar.y  other  subset 

4.  The  probability  of  occurrence  of  the  event  :s  rr  ?• 
the  size  of  the  subset  being  considered. 
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Probability  density  function. 
f  (x)  •  Xe  **  for  x  >  o 


Cumulative  distribution  function. 
*  -Xx 


F(x) 


/:  * 


dx  for  x  >  o 


1  -  e 


-Xx 


Parameter.  X  «  the  average  number  of  occurrences  of  an  event  in  a 
specified  interval  of  time  or  space. 


Mean,  y  ■  ■  average  interval  between  occurrences  of  an  event  x. 

'  X  A 

Variance . 


a 

x 


V 


2 

x 


Uses.  The  exponential  is  sometimes  referred  to  as  the  interarrival 
time  distribution.  It  is  often  used  to  represent  the  time  to  failure  of 

electronic  components  and  other  equipment  items.  Here  would  represent 

the  mean  time  between  failures  (MTBF)  of  a  component.  In  this  sense,  the 

exponential  distribution  is  known  as  the  constant  hazard  rate  reliability 

function,  i.e.,  the  failure  (hazard)  rate,  X  ,  remains  constant. 


EXAMPLE  7-7.  From  historical  records  it  is  known  that  the  MTBF  of  a  certain 
component  is  80  hours.  If  the  component  is  to  be  operated  for  10  con¬ 
secutive  hours  during  the  next  mission,  what  is  the  probability  it  will  not 
fail? 

If  it  can  be  assumed  that  time  to  failure  of  this  component  is 
exponentially  distributed,  let  x  -  time  between  failures. 


/ 


Thus  y 


80  hours 


The  probability  that  the  component  will  operate  for  more  than  10  hours  is 
found  by: 

P  (x  >  10)  -  1  -  P(x  <  10)  -  1-F(10) 
or  1  -  £ 1-e”  (io)  (10)] 

-  e"  (  80') <10)  -  e  "8  -  0.882 


The  chances  are  very  good  the  equipment  will  not  fail  during  the  next 
mission.  This  probability  (0.882)  is  really  the  reliability  of  the  equipment 
for  a  10-hour  period. 


i 

i 


I 
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CHAPTER  8 


ESTIMATION  AND  HYPOTHESIS  TESTING 


"We  use  Reason  £or  improving  the  Sciences;  whereas  we  ought  to  use  the 
Sciences  for  improving  our  Reason." 

Antoine  Amaulde#  1682 

(The  Post- Royal  Logic) 

This  observation  by  Arnaulde  is  certainly  as  applicable  today  as  it  was 
in  the  fifteenth  century.  It  is  the  purpose  of  this  chapter  to  explain  and 
demonstrate  the  use  of  science  (statistics#  in  particular#  probability  and 
hypothesis  testing)  for  improving  our  reason  (ability  to  make  a  decision). 

INTRODUCTION 

Statistical  analysis  has  come  to  play  a  central  role  in  the  decision 
making  process  within  all  of  the  logistical  disciplines.  The  type  of  ana¬ 
lysis  introduced  here  involves  the  decision  to  take  one  action  or  another 
based  upon  the  acceptance  or  rejection  of  a  statement  called  an  hypothesis. 
The  hypothesis  can  be  a  statement  about  the  output  of  a  machine,  the  number 
of  defective  parts  in  a  supply  shipment,  the  fuel  flow  in  a  turbine  engine# 
or  any  other  measurable  event  of  interest  to  the  decision  maker.  The 
hypothesis  is  actually  a  statement  made  About  the  value  of  a  parameter.  To 
test  the  hypothesis#  i.e.,  determine  whether  or  not  to  accept  the  value  of 
the  parameter  as  being  true#  a  sample  is  taken.  When  the  sample  statistic 
differs  from  the  parameter  stated  in  the  hypothesis,  a  decision  must  be 
made  as  to  whether  the  difference  is  a  consequence  of  random  sampling  error 
or  because  of  a  real  difference  between  the  population  from  which  the 
sample  was  drawn  and  the  population  whose  parameter  is  stated  in  the 
hypothesis.  The  latter  difference  is  called  a  "significant  difference"  and 
would  cause  the  decision  maker  to  reject  the  hypothesis#  i.e.#  to  conclude 
that  the  statement  about  the  parameter  is  not  true.  For  this  reason#  tests 
of  hypotheses  are  often  referred  to  as  tests  of  significance.  However, 
prior  to  developing  the  methodology  for  conducting  tests  of  hypotheses, 
relevant  and  related  material  on  statistical  estimation  is  presented  to 
provide  the  foundation  necessary  for  conducting  such  tests. 

STATISTICAL  ESTIMATION 

Fundamental  to  statistical  estimation  is  the  concept  of  a  random 
sample.  Decisions  must  be  made  without  complete  information.  Every  gre- 
nade  in  storage  can't  be  exploded  in  order  to  determine  the  rate  of  defec¬ 
tives.  Every  drop  of  water  in  a  reservoir  cannot  be  tested  for  suspended 
particulates.  Obviously,  this  would  be  too  costly  and  wasteful.  However# 
the  desired  information  about  the  population  of  interest,  i.e.,  the  supply 


of  grenades  or  the  water  in  the  reservoir,  can  be  obtained  fay  inferences 
derived  through  the  examination  of  only  a  small  portion  of  the  population . 
In  other  words,  information  about  the  characteristics  of  any  population  can 
be  obtained  through  a  random  sample. 


Let  X  be  a  random  variable  which  represents  a  population  having  a  distri¬ 
bution  with  a  mean  y  and  variance  a2.  Also  let  X^  be  a  random  variable,  the 
i™  value  observed  in  a  random  sample  from  the  above  population.  If  the 
3 ample  is  random,  then  Xj,  X2,  — ,  \  are  n  independent  random  variables  each 
having  the  same  distribution  as  the  population.  By  definition,  a  simple 
random  sample  is  one  in  which  every  element  in  the  population  has  an  equal 
and  independent  chance  of  being  selected.  Further,  a  statistic  is  a  function 
of  one  or  more  random  variables  from  a  random  sample  that  does  not  depend 
upon  any  population  parameters.  The  random  sample,  through  the  statistic 
which  is  calculated  from  it,  becomes  the  vehicle  for  making  inferences  about 
the  population  of  interest.  The  sample  statistic  then,  is  an  estimator  of  a 
population  parameter.  Common  notation  is  introduced  below: 


Sample 

Statistic 


is  an 

estimator  of 


population 

Parameter 


Variance 


x  or  u 

s2  or  o2 


Standard  deviation 


s  or  o 


Proportion 


p  or  t 


Standard  error  of 
the  mean 


s_  or  o_ 
x  x 


Calculations 


Algebraic  expressions  used  to  compute  the  statistics  are  given  below 
along  with  the  formulas  which  can  be  used  to  calculate  the  values  of  popu¬ 
lation  parameters.  The  formulas  are  common  to  all  statistics  texts  and  are 
stated  without  proof.  It  should  be  noted  that  whenever  a  sample  consists  of 
all  possible  random  variables  in  the  population,  then  the  sample  is  called  a 
census  and  the  value  of  the  sample  statistics  which  are  computed  are  the 
actual  values  of  the  population  parameters. 


Sample 


*-**■ 


Population 


N 

l  \ 

i-1  1 


Varlanca 


S2-±=i 


|  (Xj-X)1 


I  (x.-y)2 

.2  .  1-1-  _ 


Standard  deviation 


,:£T 


Standard  error  of 
the  mean 


I  (X.-X) 
1-1  x 
n-1 


N 

l  (X.-y) : 
i-1  1 


Where  n  -  number  observations  in  sample 

N  -  total  number  of  observations  (or  values)  in  population 


Properties  of  Estimators 

In  a  previous  section  it  was  stated  without  proof  that  certain  sample 
statistics  are  estimators  of  population  parameters. *  The  reason  why  X,  s2,  a, 
and  p  are  used  as  estimators  is  because  they  possess  certain  desirable 
properties.  Again,  without  proof,  the  desirable  properties  of  a  good  esti- 

A 

mator  are  given  below.  Consider  6  as  an  estimator  of  0,  then 

1.  0  is  unbiased  if  E(0)  -  0 

A  A 

2.  0  is  consistent  if  E(0)-^  8  and  Var  (0)  — —  0  as  n  -•»  - 

A  A  A 

3.  If  0  is  another  estimator  for  0  ,  then  0  is  more  efficient  than  0 
if  Var  (0)  <  Var  (5) 

-  f 

4.  If  0  contains  all  relevant  information  relative  to  the  parameter  it 

A 

is  estimating,  then  0  is  a  sufficient  estimator  of  0  . 


Stated  without  proof,  X,  a2,  a,  and  p  are  unbiased,  consistent,  efficient, 
and  sufficient  estimators  of  y,  a2,  a,  and  ir,  respectively.  Of  course, 
there  are  other  estimators  of  these  population  parameters,  but  they  are  used 
only  under  certain  conditions  because  they  do  not  contain  the  desired  prop¬ 
erties. 


For' example ,  the  a ample  median  can  be  used  to  estimate  y,  the  population 
mean.  However,  the  sample  mean  X  is  a  more  efficient  estimator  because  it 
has  a  smaller  variance  than  the  sample  median.  In  short,  the  best  estimator 
can  be  defined  as  the  unbiased  estimator  having  the  smallest  variance. 

As  another  example ,  we  know  that  s 2  is  an  estimator  of  a2.  Recall  that 

l  (xi-x) 2 

s2  »  -  .  Why  divide  by  n-1? 

Another  estimator  of  o2  is  S2,  where 


S2 


l  (X.-  X)2 
1-1  1 
n 


It  can  be  shown,  however,  that  S  is  biased,  i.e.. 
So,  instead,  we  use  s2  as  defined  above. 


In  addition  to  the  four  properties  listed  above,  X  the  sample  mean  has 
other  desirable  attributes. 


Central  Limit  theorem 


One  of  the  most  fundamental  and  important  theorems  in  inferential  statis¬ 
tics  involves  the  distribution  of  the  sample  mean  X.  The  central  limit 
theorem  states  that: 


If  a  population  has  a  mean  y  and  a  finite  standard  deviation  o, 
then  the  distribution  of  the  means  of  all  possible  samples  of  size  n 
drawn  from  that  population  will  be  approximately  normal  with  a  mean  y 

and  standard  deviation  -  . 

/n 


It  is  important  to  note  that  the  distribution  of  sample  means  X  will 
approach  a  normal  distribution  even  when  the  population  itself  is  not  nor¬ 
mally  distributed.  The  distinction  should  be  made  between  the  distribution 
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of  the  sample  and  the  distribution  of  the  saaple  mean,  if  the  sample  is 
random,  then  the  distribution  of  the  sample  should  resemble  the  distribu¬ 
tion  of  the  population .  in  contrast,  the  distribution  of  the  sample  mean 
will  be  normal.  The  understanding  of  this  important  distinction  is  essen¬ 
tial  to  the  application  of  sampling  theory  to  estimation  and  hypothesis 
testing. 


Sampling  Distributions 

In  many  cases  it  is  convenient  (and  correct)  to  assume  that  a  population 
is  normally  distributed.  When  this  is  true,  making  inferences  about  popu¬ 
lation  parameters  is  a  simple,  straightforward  task.  To  aid  in  the  process 
of  statistical  estimation  several  distributions  have  been  developed.  Of 
concern  here  are  the  t,  y},  and  F  because  sample  statistics  have  been  shewn 
to  follow  these  distributions.  The  use  of  these  distributions  requires 
that: 

1.  the  populations  are  approximately  normally  distributed,  and  that 

2.  the  observed  valves  of  the  random  variable  in  the  sample  sure 
independent.  This  will  be  true  if  the  saaple  is  random. 

These  observations  are  often  referred  to  as  normally  independently 
distributed  (NID)  random  variables.  As  such,  they  possess  certain 
desirable  characteristics .  For  example,  the  sum  of  NID  random  variables  is 
itself  normally  distributed  and  the  sum  of  the  squares  of  NID  random  vari¬ 
ables  has  a  Chi-square  (x2)  distribution. 

The  t  distribution  is  useful  in  making  inferences  when  using  X  as  an 
estimator  of  y.  It  can  be  shown  that 


Like  the  normal  distribution,  the  t  distribution  is  continuous,  symmetric, 
and  unbounded.  Its  shape  depends  upon  its  degrees  of  freedom.  As  degrees 
of  freedom  get  larger,  the  t  approaches  a  normal  distribution.  Degrees  of 
freedom  for  the  t  distribution  are  equal  to  the  saaple  size  minus  one  (n-1) . 

The  x2  distribution  is  useful  in  making  inferences  about  o2  when  s2  is 
used  as  an  estimator.  The  x2  distribution  has  degrees  of  freedom  (v)  equal 
to  n-1  and  is  related  to  the  t  distribution  in  the  following  manner.  Recall 
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When  the  denominator  is  multiplied  by 


a 


1  then 


the  numerator. 


is  the  standard  normal  variable  Z—-N (0,1),  and  the 


denominator 
of  freedom. 


A 


n-1 


,  or  the  x2  distribution  divided  by  its  degrees 


Squaring  and  rearranging  this  expression  results  in 


(n-1) a2  _ 

a2 


A 


where  v  •>  n-1.  The  X2  is  a  continuous  distribution  which  has  a  lower 
bound  of  zero  and  no  upper  bound.  Ms  such,  it  is  skewed  right.  (See 
Figure  8-1) .  Its  shape  depends  upon  v. 


the  F  distribution  is  the  ratio  of  two  randan  variables,  each  having  a 
X2  distribution  divided  by  their  respective  degrees  of  freedom,  therefore. 


Like  the  x2  distribution,  the  F  distribution  is  continuous  has  a  lower 
bound  of  zero  and  no  upper  bound,  and  is  skewed  right,  it  is  related  to 
the  t  distribution  as 

t*  _  [n(o,i)12  ,  A  "  1  ,  pl 

v  _xj_  xj 

V 

thus,  t*  is  the  same  as  the  F  distribution  with  one  degree  of  freedom  in  the 
numerator  and  v  degrees  of  freedom  in  the  denominator. 

In  summary,  the  use  of  sailing  distributions  make  it  very  convenient  to 
make  inferences  about  the  population  parameters  y  and  a2.  In  later  sections 
these  distributions  will  be  used  to  test  hypotheses  and  construct  confidence 
intervals  for  these  parameters.  The  t  distribution  will  be  used  to  infer 
on  y;  the  x2  and  F  distributions  will  be  used  to  infer  on  a2. 

Point  Estimation 


TWo  kinds  of  estimation  are  useful.  The  first  is  called  a  point  estimate 
wherein  a  single  value  is  used  to  estimate  a  population  parameter.  Point 
estimators  have  already  been  discussed.  Thus,  if  it  were  desired  to  estimate 
the  average  useful  life  of  Air  Force  bus  tires,  a  sample  of  tires  would  be_ 
selected  from  the  population  of  all  Air  Force  bus  tires.  The  sample  mean  X 
of  miles  before  wearout  would  be  computed.  This  would  then  be  the  beat 
estimate  of  the  average  miles  before  wearout  of  all  Air  Force  bus  tires.  The 
estimate  could  be  used  for  planning  new  tire  purchases. 

Note  that  X  is  the  mean  of  a  single  sample  of  tires  in  the  illustration. 

If  another  sample  were  taken,  the  new  X  would  most  likely  differ  from  the 
first.  The  question  to  be  asked  would  be,  which  one  is  the  correct  X  to  use? 
By  definition,  both  are  "best  estimators”  of  y.  The  answer  lies  in  the  use 
of  an  interval  estimator. 

Interval  Estimation 

The  objective  of  interval  estimation  is  to  provide  a  range  of  values  in 
which  the  population  parameter  is  estimated  to  lie.  TWo  values  are  specified, 
the  upper  and  lower  bounds .  Along  with  the  bounds ,  a  degree  or  level  of 
confidence  is  specified  which  can  be  defined  as  the  probability  with  which 
the  population  parameter  will  be  "captured”  by  the  two  bounds .  The  resulting 
interval  is  called  a  confidence  interval. 

The  level  of  confidence  is  expressed  as 

100  (1  -  o)  *  >- 


where  o  is  some  small  value  less  than  1,  such  as  0.05  or  0.01,  which  represents 
the  probability  that  the  interval  will  not  capture  the  population  parameter. 


In  developing  estimates  for  the  population  mean  y,  two  distributions  are 
used,  the  normal  and  the  student  t.  Both  of  these  are  continuous  and 
unbounded,  i.e.,  the  range  is  -<*>  to  +*>.  For  this  reason,  v«e  can  never  be 
100%  confident  that  our  interval  will  capture  the  population  parameter. 
However,  in  the  effort  to  place  as  much  "confidence"  as  possible  in  our 
estimate,  we  use  small  values  of  a. 

The  confidence  interval  on  the  population  mean  is  given  by  the 
following: 


X  -  Z 


<  U  <  X  +  z 


a  J — 
2  /n 


Where 


m  sample  mean 

»  known  population  standard  deviation 
a  sample  size 

a  value  of  the  standard  normal  deviate  for  a 
specified  level  of  y. 


This  formulation  assumes: 


1.  a  is  known. 


2.  The  sample  is  luge,  i.e.,  n  >  30  or  the  sample  was  drawn  from  a 
normal  population. 

If,  however,  o  is  not  known,  the  value  of  Z  is  replaced  with  the  appro¬ 
priate  value  of  the  student  t  with  n-1  degrees  of  freedom.  The  value  of  s 
is  used  to  estimate  the  unknown  value  of  o.  The  (1  -  o) *100%  confidence 
interval  then  becomes: 


X  -  t 


f  * 


<  y  <  X  +  t  • 

•  “  fl 


where  t  is  the  value  from  the  student  t  distribution  with  n-1  df 

CL 


such  that  P(t  >  t  ) 
a 


.  As  in  the  confidence  interval  for  y  where  a  is 


known  r-  is  used  to  indicate  a  "two-sided"  interval.  In  the  case  where  a 


"one-sided"  confidence  interval  ia  desired,  use: 


X  - 


t 

a 


y  <  «* 


upper  or  right-side  interval 


or 


x  +  ta 


lower  or  left-side  interval 


The  probability  statement  for  a  lower  confidence  interval  for  y  can  then  be 
constructed  as: 


P(y  <  X  +  t  ’  — —  )  -  1  -  o 

®  /r 


To  illustrate r  consider- the  case  where  contract  specifications  require 
that  the  tensile  strength  of  a  certain  item  on  the  average  be  no  less  than 
200  psi.  Since  destructive  testing  is  costly,  the  quality  control  engineer 
decides  to  test  a  small  sample  of  IS  items  and  obtains  the  following  results. 


X  -  254  psi 


n  ■  15  items 


3  *  24  psi 


t  2.624  (degrees  of  freedom  -  14) 

<1  ■  eUA 


Be  constructs  a  one-sided  confidence  interval  (upper)  with  a  confidence  level 
(1  -  a)  ■  .99  in  the  following  manner,  assuming  <7  is  unknown. 


99%  Cl 


254  -  2.624 


237.7  <  y  <  - 


The  engineer  can  be  99%  confident  that  the  population  of  items  produced  has 
an  average  tensile  strength  greater  than  237.7  psi.  In  other  words,  he  can 
be  fairly  certain  the  itsnm  are  exceeding  the  tensile  strength  specifications 
called  for  in  the  contract. 


Finite  Population  Sampling.  When  sampling  from  a  finite  population,  it  c»p 

-  gZ 

be  shown  that  Var  (X)  +  -  ,  therefore,  the  use  of  a  correction  factor  is 

n 

required.  The  variance  of  the  same  mean  then  becomes: 


Var  (X) 


where  N  ■  size  of  the  population 
n  ■  size  of  the  sample 


It  can  be  readily  seen  that  when  the  entire  population  is  included  in  the 
sample,  the  variance  of  the  sample  mean  is  zero  because: 


Var  (X) 


te)-< 


This  is  consistent  with  sampling  theory  in  that  when  the  entire  population 
is  included  in  the  sample,  the  sample  mean  X  is  really  the  population  mean  y, 
which  has  no  variance,  i.e.,  it  is  a  constant.  It  should  also  be  carefully 
noted  that  the  variance  of  the  sample  mean  gets  smaller  and  smaller  as  the 
sample  size  n  approaches  the  population  size  of  a  finite  population,  or 
when  the  sample  size  gets  extremely  large  from  an  infinite  population.  So, 


as  n 


■N,  then 


(finite  population) 


and  n 


",  then 


0  (infinite  population) 


Sample  Size  Determination,  it  is  often  desirable  to  develop  a  confidence 
interval  at  some  specified  width.  For  example,  the  civil  engineer  may  wish 
to  insure  that  the  average  weight  for  bags  of  cement  delivered  to  his  facility 
does  not  vary  from  y  by  more  than  five  pounds.  Hew  large  a  sample  must  be 
weighed?  His  specified  conditions  state  that  the  maximum  deviation  E,  or  the 
allowable  error  of  X  from  y  is: 


E  -  X  -  y  -  5 

The  engineer  wishes  to  be  95%  confident  that  X  does  not  differ  from  y  my 
more  than  5  pounds.  For  this  level  of  confidence  then 

E  -  5  -  1.96o 


Assuming  the  weights  of  the  bags  of  cement  are  normally  distributed  and  o 
is  known  to  be  10  pounds: 


5  -  1.96 


The  sample  size  is  found  by  solving  the  equation  for  n. 


Thus,  to  insure  95%  confidence  that  the  sample  mean  will  not  differ  from  the 
true  population  mean  by  more  than  five  pounds,  the  civil  engineer  will  have 
to  weigh  154  bags  of  cement. 

The  general  expression  for  determining  sample  size  is  given  by 

-  fz  •  rt  1  2 


where 


n  "  required  sample  size 

Z  -  value  of  the  standard  normal  variable  for  a  confidence 
T’  level  of  (1  -  a)  •  100% 


c  *  known  standard  deviation  of  the  population. 

It  a  is  unknown,  a  pilot  sample  should  be  taken 
and  s  computed  as  an  estimator 

E  -  maximum  allowable  deviation  from  y,  or  the 
confidence  interval  half-width. 


In  a  similar  manner,  the  appropriate  sample  size  can  be  computed  when  a 
desired  level  of  precision  is  necessary  in  estimating  a  population  proportion. 


n  *>  ir(l-ir) 


It  cam  be  seen  that  the  maximum  sample  size  will  occur  when  a  value  of  0.50 
is  used  for  ir. 


Confidence  Intervals  for  Variances .  In  a  maimer  similar  to  that  used  for 
developing  interval  estimates  on  y,  confidence  intervals  can  be  constructed 
to  estimate  population  variances, a2.  Since  the  sampling  distribution  for 

variance  is  x^»  and  “  - - — ,  then  a  two-sided  confidence  interval 

c2 

for  o2  can  be  expressed  as: 


(n  -  l)s2 


<  a*  < 


(n  -  l)s' 
X?  * 

*V'  2“ 


where  s2 


.sample  variance 


K.  l-f 


upper  value  of  the  x2  distribution  with  v 
degrees  of  freedom 


X2  ° 
V'  2 


lower  value  of  the  x2  distribution  with  v 
degrees  of  freedom 
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.a. !■  n- 


For  example,  a  loadmaster  wished  to  be  95%  confident  of  the  variance 
in  the  weight  of  0-141  cargo  and  obtained  the  following  information  from 
a  sample . 

n  ■  25  s2  ■  18  (hundreds  of  pounds)2 

The  95%  con fi den©  interval  would  be: 

(25  -  1)  (18)  .  _  _  (25  -  1)  (18) 

- 397 4 - -  a  -  - IO - 


or  10.96  <  o“  <  34.84 


Similarly,  to  develop  one-sided  confidence  intervals,  the  following  can  be 
used: 


For  the  lower  (left)  side: 


0  <  o2  <  <3— 


(Note  that  —  is  not  used) 


For  the  upper  (right)  side: 


(n  -  l)s' 


<  o2  <  « 


V,  1  -  o 


In  the  previous  example,  the  two  one-sided  intervals  would  be: 


I~or,  ,2  <  <«  ZSLiiSl  or  ,2  <  3J.J 


uppor:  o2  »  fl8)  or  c2  I  11.87 


the  appropriate  val\*es  of  the  x2  distribution  are  obtained  from  the  x2 
table  by  finding  the  degrees  of  freedom  and  specifying  the  level  of  confidence 
desired. 


Figure  8-2  graphically  illustrates  the  relationships  among  the  inter¬ 
vals  computed  above. 


Figure  8-2 


Type  of 
Interval 


TWo-sided 

One-sided: 

Lower 


tipper 
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It  will  be  shown  later  that  confidence  intervals  are  useful  in  the  testing 
of  hypotheses,  a  subject  to  which  we  new  turn. 


Statement  of  Hypothesis 

As  indicated  at  the  beginning  of  the  chapter,  a  hypothesis  is  a  state¬ 
ment  about  a  population  parameter.  In  hypothesis  testing,  two  hypotheses 
are  constructed.  The  first  is  called  the  null  hypothesis  which  represents 
a  claim  or  a  statement  which  we  wish  to  refute.  The  other  is  called  the 
alternate  hypothesis  which  is  a  statement  which  we  desire  to  support  with 
the  evidence  supplied  through  a  sample.  It  is  important  to  note  that 
hypotheses  are  always-  constructed  on  population  parameters,  not  sample 
statistics . 

To  illustrate,  consider  the  case  of  a  supply  officer  who  must  accept  a 
shipment  of  repair  parts  from  a  supplier  unless  it  can  be  shewn  that  the 
parts  do  not  meet  specifications,  suppose  the  specifications  require  that 
the  average  weight  of  the  parts  does  not  exceed  25  pounds.  The  supply 
officer  would  construct  the  following  hypotheses. 

Null  hypothesis  —  Hc:  |i  <,  25  lbs 

Alternate  hypothesis  —  H, :  y  >  25  lbs 


Under  the  circumstances ,  the  supply  officer  will  accept  the  shipment 
of  repair  parts  unless  he/she  can  show  that  a  sample  of  repair  parts 
weighs  significantly  more  than  25  pounds.  In  other  words,  he/she  must  be 
able  to  reject  the  null  hypothesis  before  he/she  can  reject  the  shipment. 

By  rejecting  the  null  hypothesis,  he/she  is  stating  that  he/she  has  sample 
evidence  to  support  the  alternate  hypothesis.  If  he/she  cannot  reject  the 
null  hypothesis,  it  is  because  the  sample  data  are  not  sufficient  to  show 
that  the  specifications  are  not  met. 

So,  in  any  test  of  hypotheses,  a  "strong"  conclusion  is  reached  only 
when  the  null  hypothesis  can  be  rejected.  If  Ho  cannot  be  rejected,  the 
only  conclusion  which  can  be  reached  is  that  the  sample  data  do  not 
support  the  alternate  hypothesis.  This  is  known  as  the  "weak”  conclusion; 
weak  because  it  is  a  fall-back  conclusion,  i.e.,  the  sample  evidence  can¬ 
not  support  the  contrary. 

Level  of  Risk 

It  is  possible  that  our  supply  officer  will  reject  the  null  hypothesis 
and,  consequently,  the  shipment  of  repair  parts  erroneously.  This  could  be 
costly  both  in  a  monetary  sense  (contractor  sues  because  the  parts  do  meet 
specifications)  and  in  mission  sense  (parts  are  needed  to  fill  Not 
Operationally  Ready — Supply  requisitions) .  To  preclude  this  from  happening 
too  frequently,  a  level  of  significance  is  established.  The  level  of 
significance,  called  a,  is  set  at  a  level  which  corresponds  to  the  conse¬ 
quences  which  will  be  incurred  if  the  null  hypothesis  is  rejected  when  it 
is  in  fact  true.  When  the  effect  of  a  wrong  decision  may  involve  human 
lives,  a  might  be  set  as  low  as  .0001.  On  the  other  hand,  a  hypothesis 
test  involving  the  behavioral  patterns  of  experimental  rabbits,  an 
acceptable  level  of  a  could  be  .25.  In  any  event,  the  logistician  must  be 
cognizant  of  the  consequences  (costs)  involved  in  making  a  wrong  decision. 

In  probabilistic  terms,  the  level  of  significance  a,  can  be  stated  as 
the  probability  that  Ho  will  be  rejected  when  it  is  true,  a  then  is 
referred  to  as  the  level  of  significance  of  the  test.* 


Conducting  Testa  of  Hypotheses 

Four  tests  of  hypotheses  are  discussed  including  the  t  tests  on  means 
and  proportions,  and  the  chi-square  and  F  tests  on  variance.  Bach  test 
performed  can  be  classified  as  a  one* or  two-tailed  test.  For  a  one- 
tailed  test,  the  experimenter  has  a  priori  knowledge  about  the  direction 
of  the  difference  to  be  ascertained  through  the  sample.  This  is 
demonstrated  in  the  following  example. 

A  supplier  of  steel  rods  to  the  Air  Force  claims  that  their  average 
tensile  strength  is  750  pounds.  To  test  this  claim,  the  Air  Force  using 
activity  would  establish  the  following  hypotheses! 

Hqs  p  >  750  pounds 

Hi:  p  <  750  pounds 


If  the  Air  Force  can  show  through  a  sample  of  rods  that  the  tensile 
strength  is  significantly  less  than  750  pounds,  i.e. ,  reject  Hq,  then  the 

supplier's  claim  has  been  refuted  and  shipments  of  the  rods  can  be 
refused. 

In  the  same  example,  the  supplier  would  establish  the  following 
hypotheses : 


Hqs  p  <  750  pounds 
Hjs  p  >  750  pounds 

In  this  case,  the  supplier  would  like  to  show  that  the  rods  have  a 
tensile  strength  of  at  least  750  pounds.  Obviously  with  same  sample,  only 
one  rejection  of  H0  can  occur.  The  first  test  is  known  as  a  one-tailed 
test  to  the  left,  identified  by  the  direction  of  the  inequality  of  the 
alternate  hypothesis.  The  strong  conclusion  is  reached  by  rejecting  the 
null  hypothesis  and  accepting  the  alternate.  This  conclusion  is  supported 
by  sample  data. 

In  the  second  case,  a  one-tailed  test  to  the  right,  is  one  in  which 
the  supplier  would  like  to  reject  H0,  i.e.,  support  his  claim  with  sample 
evidence. 

A  third  situation  may  exist  in  which  the  direction  of  the  test  is 
unimportant  or  the  experimenter  has  no  a  priori  knowledge  of  the  direction 
of  the  difference  being  tested. 

For  example,  consider  the  case  of  the  diameter  of  ball  bearings.  To 
work  properly,  they  can  neither  be  too  large  nor  too  small.  TO  test,  the 
following  two-tailed  hypotheses  would  be  established: 

HQ:  p  -  15  am 

Hj:  p  ?  15  mm 

Here,  specifications  would  state  that  the  diameter  of  ball  bearings 
must  be  15  millimeters .  Rejection  of  Hq,  i.e.,  acceptance  of  the  strong 
conclusion  would  occur  if  a  aasple  of  ball  bearings  had  average  diameters 
either  significantly  less  than  or  significantly  greater  than  15  ns. 

t  Tests  on  Means  (one  sample) 

TVo- tailed  Test:  H0:  p  «  pQ  One-tailed  Test:  Hq:  v  <  Vq 

His  p  ?  Uq  hj:  V  >  U0 


u  is  a  value  to  be  tested 


or 


Hol  v  l  »o 

Hj:  p  <  pe 


Assumptions : 


1.  Random  sample 

2.  Normal  population  or  sampla  sirs  >  25 

3.  o2  unknown 


Test  statistic:* 


where  X  *  sampla  mean 

Uq  ■  hypothesised  value  to  be  tested 
s  -  sample  standard  deviation 
n  ■  sample  ai.se 


Decision  rule:  Reject  Hq  if  t  is  more  extreme  than  te  ,  a  critical  value 
of  t  determined  by  the  level  of  a  chosen  and  degrees  of  freedom  of  the 
sample  (n  -  1) . 

•When  a2  is  known,  the  standard  normal  variable  is  used  as  the  test  statistic 
where 


Z 


.  *  -  **0 


s 


z  is  compared  with  zc  ,  from  the  standard  normal  table  at  the  specified  level 


Casa  1*  Pal rad  Data— nonindapandant  sanplas,  sonatinas  rafarrad  to  as  tha 
"bafora  and  a£tar"  tast. 

Assumptions: 

1.  Paired  data  (ralatad  sanplas) 

2.  o2  unknown 

3.  a2  tannins  constant  for  both  sanplas. 

4.  Nomal  population  or  saapls  sisa  >  25. 


Test  statistic: 


Let  ■  ^21 


then  t  ■ 

SD 


where  s^ 


and  sD  ■ 


and  t  has  n-1  degrees  of  freedom. 


Example.  This  test  could  be  used  to  measure  the  effectiveness  of  a  training 
program  where  the  same  individuals  are  tested  before  training  and 
after  training.  A  significant  improvement  in  test  scores  after 
training  would  indicate  that  the  program  was  effective. 

Case  2:  Independent  Samples 
Assumptions : 

1.  Independent  samples 

2.  Normal  populations  or  sample  sizes  >  25 

3.  a j  and  o|  known 
Test  statistic: 


(Xj  -  X2) 


This  test  along  with  Cases  3  and  4  can  be  used  to  determine  whether 
two  independent  samples  were  drawn  from  the  same  population,  i.e. ,  with  the 


and  t*  has  an  approximate  t  distribution  with 


degrees  of  freedom 


t  tests  on  Proportions 
One  sample: 

TVo- tailed  test:  HQ:  »  ■  irQ  One-tailed  test:  Hg:  ir  5  irQ 

®1 !  w  *  *o  Hls  *  >*Q 

or  Hg:  w  2  *0 

Hl*  w  <  * 

o 


Test  statistic: 


P  -  »o 

o 

P 


where 

p  -  sample  proportion  or  the  number  in  the  sample  having  a 
specific  attribute  divided  by  the  total  number  of 
observations  in  the  sample 

*0  -  the  hypothesized  proportion  to  be  tested 

0^  -  standard  error  of  the  proportion  which  can  be  found  by 


Ihis  test  statistic  should  be  used  only  when  the  sample  is  large,  i.e. 

when  irQ  •  n  2  25.  Otherwise  the  binomial  distribution  should  be  used  to 

test  the  hypothesis. 

Example .  A  clothing  sales  outlet  has  found  that  15%  of  new  short-sleeved 
shirts  are  returned  by  AF  customers  because  they  are  missized  by 
the  manufacturer.  The  maker  was  advised  of  the  problem  and 
subsequently  developed  a  new  sizing  process.  Of  the  next  batch 
of  500  shirts  sold,  60  were  returned  because  of  missizing.  Was 
the  new  process  effective  in  reducing  the  number  of  returns? 

H0!  IT  >  .15 

His  it  <  .15 

test  at  a  *  .05 


Reject  H  since  Z  <  Z  and  accept  Hi  ,  the  new  process  significantly 
o  c 

reduced  the  proportion  of  defectives  (returns  for  missizing) . 

1 


Tests  on  Variances  (On «  Sample) 

Two-tailed  test:  H  :  a 2  -  a2  One-tailed  test:  H_:  o2  <  <j2 

o  V'  O  —  o 

Hj:  o2  jt  a2  Hj:  a2  >  aj 

or  HQ:  a2  2  a2 

O 

Hj:  a2  <  a2 
-1  o 


Assumption:  All  tests  of  variances  assume  that  samples  have  been  drawn 
from  normal  or  near  normal  populations. 


Test  statistic:  One  sample  tests  of  variance  involve  use  of  the  Chi-square 
statistic. 


where. 


o 


where  s2  »  sample  variance 
n  *  sample  size 

c2  *  hypothesized  value  of  the  variance  to  be  tested 
The  value,  test  statistic,  has  n-1  degrees  of  freedom. 


Decision  rule:  Reject  HQ  if  x£  >  x2  •  where  x2  is  the  critical  value  of  the 
X2  with  n-1  degrees  of  freedom  at  a  specified  level  of  a, 
obtain. d  from  a  table  of  x2  values. 

Example .  The  base  supply  officer  will  discontinue  doing  business  with  a 
supplier  whose  delivery  schedule  varies  by  more  than  10  (days)2, 

i.e.,  c2  >  10.  Re  records  the  delivery  times  of  Supplier  A  for 

the  next  25  deliveries  and  finds  s2  -  18,  i.e., 

I  (X.  -  X) 2 
iml  — —  - le- 

where  X^  ■  number  of  days  to  deliver  the  1th  order 
n  ■  number  of  orders 

X  -  average  number  of  days  to  deliver  an  order 
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T 


Y" 


i 


I 

1 

1 

( 

$ 


The  test  is  performed  at  a  ■  .05 


H0:  c2  <  10 


Hji  O2  >  10 


(18)  (25-1) 

10  “43*2 


X*  -  36.4 


Reject  Hq,  >  X2  *nd  conclude  the  variability  of  delivery  tines  from 
Supplier  A  exceeds  10  (days)2. 

An  alternate  method  for  computing  a  test  statistic  can  be  used  to  find 

the  largest  acceptable  value  of  s2,  which  can  be  called  s2  ,  where 

c 


where  x2  is  the  critical  value  of  x2  with  n-1  degrees  of  freedom  at  a 

specified  level  of  a.  In  this  cue,  reject  H.  when  s2  >  s2. 

c 

In  the  example. 


s2 

c 


(36.4) (10)  ..  _ 

(25  -  1)  15,2 


Since  s2  >  s2 
c 


reject  Hq,  the  conclusion  is  the  same  as  above. 
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Tuts  of  Variance  (Pro  Sample) 


Recalling  the  t  teats  on  differences  of  means  of  two  independent 
samples ,  it  is  necessary  in  deciding  the  appropriate  cue  to  apply,  to 
determine  whether  unknown  variances  are  equal  (Cue  3)  or  unequal  (Cue  4) . 
The  appropriate  procedure  is  to  perform  an  hypothesis  test  on  the  sample 
variances  in  question.  To  do  this,  an  F  test  is  applied.  Again,  it  must 
be  usumed  that  the  independent  samples  are  drawn  from  normal  or  near 
normal  populations. 


TWo-tailed  tests: 


"o1 


■i*  °i 


2 

2  »*  *2 


One  tailed  test:  H0:  of  <  <j f 


Hr  >  °2 


or 


H0:  of  >  of 


H, :  O?  <  O? 


Test  Statistic: 


*2 


where  sf  -  variance  computed  from  the  sample  with  the  largest 
variance 

sf  -  variance  from  the  sample  with  the  smallest  variance 

F  hu  a  distribution  with  nj  -  1  degrees  of  freedom  in  the  numerator 
and  n2  - 1  degrees  of  freedom  in  the  denominator. 

Decision  rule:  Reject  HQ  if  C^nl  ^  >  p ^nl  ^ 

<n2-l)  c(n2-2) 

Example.  The  weather  service  is  testing  the  variance  of  bursting  strength 
of  balloons  from  two  competitors  and  has  gathered  the  following 
data. 


Company  A 


Company  B 


n 


A 


21 


s^  ■  2.5  psi 


"B  "  16 

Sg  -  1.5  psi 
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Is  there  a  significant  difference  in  the  variance  of  bursting  strength 
between  the  two  companies'  balloons?  The  test  is  to  be  performed  at  a  -  .02. 


v  i  -  %2 
v  tf  *  i 


(2.5)  2 

(1.5)  2 


2.778 


C  (21-1) 
r°  (16-1) , 


.01 


3.37 


F  /  Fc  ,  do  not  reject  H0.  Therefore  we  cannot  conclude  there  is  a 

significant  difference  in  the  variances  of  bursting  strength  between  the 
two  companies'  balloons.  Note  that  in  a  two-tailed  test,  the  value  of  a 
is  halved. 


w% 


In  summary,  to  conduct  tests  of  hypotheses,  the  following  methodology  is 
appropriate : 

1.  State  the  hypotheses.  It  is  often  best  to  establish  the  alternate 
hypothesis  first.  H.  normally  contains  a  statement  of  that  which  is  to 
"proved,"  or  supported  by  simple  data.  The  null  hypothesis  then  includes 
all  other  possibilities.  In  many  cases,  the  null  hypothesis  is  a  manu¬ 
facturer's  claim.  Here  it  is  assumed  that  the  claim  will  be  accepted  unless 
it  can  be  shewn  (by  sample  evidence)  that  it  is  not  true.  Decide  whether 
the  test  is  to  be  one-  or  two-tailed  . 


2.  Set  the  level  of  significance.  What  are  the  costs  of  being  wrong, 
i.e.,  rejecting  a  true  hypothesis?  For  two-tailed  tests,  use  a  . 

2 

3.  Select  the  appropriate  test  statistic.  The  choice  of  the  test 
statistic  will  depend  upon  the  statement  made  in  the  null  hypothesis. 

4.  Select  the  sample  and  compute  the  test  statistic. 

5 .  Compare  the  computed  value  with  the  critical  value  obtained  from 
the  appropriate  statistical  table. 

6.  Reject  H»  if  the  test  statistic  is  more  extreme  than  the  critical 
value .  Otherwise,  do  not  reject  Hq. 

7.  Draw  the  conclusion.  State  what  has  been  accomplished  by  rejecting 
(or  not  rejecting)  the  null  hypothesis. 
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CHAPTER  9 


INDEX  NUMBERS 


Index  numbers  are  ratios,  usually  expressed  as  percentages,  indicating 
changes  in  values,  quantities  or  prices.  Typically,  the  changes  are  measured 
over  time,  each  index  being  compared  with  the  corresponding  index  from  some 
selected  base  period.  Simple  index  numbers  deal  with  homogeneous  denominations 
representing  commodities  such  as  plywood,  steel  or  grain.  More  commonly, 
though,  index  numbers  are  aggregates  of  a  number  of  different  commodities, 
products  or  services.  Each  item  in  the  aggregation  is  weighted  in  proportion 
to  its  amount  in  a  particular  end  item,  industry  or  geographical  region. 

Index  nuntoers  are  commonly  classified  into  three  different  types:  price, 
quantity,  and  value.  A  series  of  price  index  numbers  represent  changes  in 
prices  of  items  or  commodities  over  time.  An  example  of  a  price  index  number 
series  is  the  Wholesale  Price  Index  (published  monthly  by  the  Bureau  of  Labor 
Statistics)  which  represents  the  change,  over  time,  in  the  average  wholesale 
prices  of  commodities  and  products  sold  in  the  United  States.  A  quantity  index 
measures  the  change  in  the  amount  of  a  commodity  or  product  produced  over 
periods  of  time.  The  Federal  Reserve  Board  compiles  a  quantity  index  called 
the  Index  of  Industrial  Production  which  measures  physical  volume  of  factory 
production  in  the  united  States  from  one  yetr  to  the  next.  A  value  index 
combines  changes  in  both  price  and  quantity  over  time.  Value  indexes  can  be 
considered  to  be  the  product  of  a  price  index  and  quantity  index.  A  commonly 
used  value  index  is  the  Index  of  Retail  Sales  published  in  the  Federal  Reserve 
Bulletin  which  reflects  the  changes  in  both  the  prices  and  the  quantities  of 
items  sold  by  retail  sales  outlets  across  the  United  States. 

When  dealing  with  index  numbers,  it  is  important  to  identify  the  type  of 
index  number  one  is  working  with  and  use  that  type  of  index  number  consistently 
throughout  the  analysis.  In  the  following  discussion  of  the  different  tech¬ 
niques  used  to  construct  index  numbers,  the  equations  and  examples  shown  are 
for  price  indexes.  However,  a  simple  renaming  of  the  variables  would  make  the 
equations  appropriate  for  any  type  of  index  number  construction. 

Constructing  Price  Index  Numbers 

Price  index  numbers  indicate  price  changes  with  respect  to  time  for  some 
specific  commodity,  product  or  service.  As  historical  Indicators,  index  num¬ 
bers  become  more  accurate  if  they  are  constructed  using  actual  prices  paid  for 
a  particular  commodity,  product  or  service  rather  than  than  using  the  more 
general  aggregative  index  numbers  published  by  agencies  such  as  the  Bureau  of 
Labor  Statistics.  Accordingly,  this  section  will  treat  five  methods  of  price 
index  nixnber  construction  including  examples.  They  are:  simple  price  indexes, 
simple  aggregative  price  indexes,  weighted  aggregative  price  indexes  of  the 
Laspeyres  and  Paasche  types,  and  weighted  average  of  relative  indexes.  A  short 
discussion  of  selecting  the  base  period  is  included  before  presenting  the 
individual  methods  of  construction. 
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The  Base  Period 


Price  index  numbers  are  price  relatives,  usually  expressed  as  percentages. 
As  price  relatives,  they  relate  prices  paid  in  one  time  period  to  prices  paid 
in  some  base  time  period.  To  provide  comparability,  a  series  of  index  num¬ 
bers  representing  some  commodity,  product  or  service  is  always  constructed 
using  the  same  base  period,  thus  reflecting  a  percentage  increase  or  decrease 
in  prices  relative  to  that  base  period. 

0 

The  selection  of  a  base  period  is  usually  an  arbitrary  process.  On  a 
short  series  of  data,  say  five  to  ten  years,  the  analyst  often  chooses  the 
first  tearliest)  year  as  the  base  year.  Under  ideal  conditions,  it  is  best  to 
choose  as  a  base  year  a  year  in  which  prices  are  not  changing  erratically. 

This  is  difficult  when  hundreds  of  items-  are  included  in  an  aggregative  index 
number  and  leads  one  back  to  an  arbitrary  choice  of  a  base  year. 

A  shift  in  a  series  of  index  numbers  to  a  new  base  year  is  a  straight¬ 
forward  calculation  if  it  is  known  that  the  original  sample  of  goods  and 
services  remains  representative  of  the  price  changes  to  be  indicated.  For 
example ,  consider  the  problem  of  adjusting  a  1976  index  number  with  a  base 
year  of  1967  to  a  base  year  of  1972.  In  addition  to  the  value  of  the  1976 
number  (say  151.9)  one  needs  the  value  of  the  corresponding  number  in  the 
same  series  in  1972  (say  118.0) .  The  conversion  is  accomplished  by  dividing 
the  index  number  to  be  changed  by  its  corresponding  value  from  the  new  base 
year  and  expressing  the  result  as  percentage.  In  this  example,  the  1976  index 
number  with  a  new  base  year  of  1972  would  be  (100  X  151.9  *  118.0)  or  128.7. 

The  same  information  is  relayed  by  the  revised  index  number  series  as  the 
initial  series.  Only  the  base  year  has  been  changed. 

Simple  Price  Indexes 

A  simple  price  index  series  is  usually  a  time  series  of  price  relatives 
converted  to  percentages.  A  price  relative  in  this  case  is  the  average  price 
of  an  item  for  a  given  time  period  (e.g.,  a  given  year)  divided  by  the  average 
price  of  an  item  for  the  base  time  period. 

In  constructing  price  index  numbers,  it  is  important  to  express  a  price 
in  dollars  per  measure  of  quantity  (e.g.,  $/#,  $/person  or  $/ft. 2) .  These 
measures  of  quantity  are  used  in  the  construction  of  the  weighting  factors 
for  weighted  aggregative  index  numbers.  One  should  not  express  the  price  in 
terms  of  dollars  per  period  such  as  it  found  in  accounting  data.  Accounting 
data  need  to  be  edited  to  dollars  per  measure  of  quantity  before  their  use  in 
price  index  number  construction. 

There  are  four  steps  to  be  followed  in  constructing  a  simple  price  index 
number.  They  are:  (1)  Collect  or  develop  average  time  series  price  data 
for  the  product,  commodity  or  service  to  be  analyzed.  For  example,  assume 
the  average  yearly  prices  of  h  inch  thick,  4  feet  by  8  feet,  grade  AC,  inte¬ 
rior,  sanded  plywood  per  1000  square  feet  are: 


Year  1976  1969  1971  1973  1975 

Price  $84.12  $95.06  $107.32  $121.35  $152.72 

(2)  Select  a  base  year,  say  1967;  (3)  Calculate  a  tine  series  price  relative 
for  each  year  by  dividing  each  price  by  the  base  year  price;  and  (4)  Convert 
the  series  to  an  index  number  of  percentage. 

Year  1967  1969  1971  1973  1975 

Price 

Relative  1.000  1.130  1.276  1.443  1.816 

Index 

Number  100.0  113.0  127.6  144.3  181.6 

These  index  numbers  indicate  the  percentage  change  in  price  with  respect 
to  the  base  year  only.  The  index  for  1971  of  127.6  indicates  that  the 
average  price  of  plywood  went  up  27.6  percent  with  respect  to  1967.  It  does 
not  indicate  that  the  average  increase  was  14.6  percent  (127.6  -  11.30)  with 
respect  to  1969.  To  calculate  the  percentage  increase  in  price  for  1971 
with  respect  to  1969  one  would  divide  the  1969  index  into  the  1971  index, 
multiply  the  dividend  by  100  and  subtract  100.0. 

100  [(127.6  *  113.0)]  -  100.0  or  12.9  percent 

Mathematical  notation  is  useful  in  expressing  different  methods  of  index 
number  construction.  For  constructing  simple  price  indexes,  the  notation  is 


where  j  is  a  subscript  denoting  different  periods  ranging  from  o  to  n.  In 
this  instance  Pc  is  the  price  during  the  base  year  0.  are  the  price  values 
for  the  different  years,  e.g.,  PQ  ■  $84.12  and  P£  ”  $95.06.  Ij  are  the  indexes 
for  the  different  years,  e.g.,  I0  ■  100.0,  and  I2  ■  113.0. 

Seldom  will  a  single  simple  index  number  suffice  for  pricing  purposes. 

Most  items  that  are  purchased  are  made  up  of  many  different  materials  and 
types  of  labor,  the  prices  of  which  vary  at  different  rates  as  time  proceeds. 
Therefore,  the  analyst  must  construct  composite  index  numbers  that  reflect 
aggregative  changes  in  the  prices  of  the  components,  assemblies,  and  types  of 
labor  that  make  up  an  item.  This  need  has  been  satisfied  by  developing  a 
number  of  different  methods  for  constructing  aggregative  indexes.  The 
easiest  of  these  methods  is  the  simple  average  of  relatives  approach. 
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Average  of  Relatives  Price  Indexes 


Simple  average  of  relatives  price  indexes  are  constructed  to  indicate 
time  series  changes  in  prices  of  more  than  one  item  or  commodity.  Consider 
the  construction  of  wooden  boxes  from  plywood  and  nails.  One  can  construct 
simple  price  indexes  for  nails  in  the  same  fashion  that  the  indexes  for  ply¬ 
wood  were  constructed  in  a  previous  example. 


Year 

1967 

1969 

1971 

1973 

1975 

Price 
per  keg 

$15.20 

$15.65 

$16.00 

$15.92 

$18. 10 

Price 

relative 

1.000 

1.030 

1.053 

1.047 

1.191 

Index 

number 

100.0 

103.0 

105.3 

104.7 

119.1 

To  construct  a  simple  average  of  relatives  price  index  reflecting  an 
aggregative  price  change  for  plywood  and  nails ,  one  sums  the  indexes  for  each 
commodity  during  a  year  and  divides  by  the  number  of  commodities.  In  this 
instance,  with  only  two  indexes,  the  calculations  are  easy.  For  example,  in 
1969  the  index  is  (113.0  +103.0)  +2  or  108.0. 

Year  1967  1969  1971  1973  1975 

Simple 
Aggregative 
Price  Index 
for  Plywood 

&  Nails  100.0  108.0  116.5  124.5  150.4 

Continuing  the  practice  of  writing  equations  for  index  number  construction, 
the  general  relationship  for  the  simple  average  of  relative  price  index  is: 


where  i  *  1  to  m  and  relates  to  the  different  commodities, 
products  or  services  that  make  up  the  Index  I. 

and  j  ■  0  to  n  and  relates  to  the  different  periods  from  the 
first  period  0  to  period  n. 

An  advantage  of  the  simple  average  of  relatives  price  index  is  that  it 
permits  the  user  to  mix  products  with  different  dimensions.  In  this  illustra¬ 
tion  one  product  with  dimensions  of } 000  ft2  is  mixed  with  another  product  with 
dimensions  of  kegs.  This  is  done  by  reducing  each  product  to  a  dimensionless 
ratio  before  combining. 
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A  disadvantage  of  the  simple  aggretative  pricing  index  method  is  that  it 
implicitly  assigns  weights  to  each  of  the  procucts  or  coasnodities.  Thus ,  in 
the  illustration,  1000  ft2  of  plywood  has  the  same  weight  as  one  keg  of  nails. 
This  weighting  would  be  acceptable  only  if  in  constructing  boxes,  one  keg  of 
nails  was  needed  for  each  1000  ft2  of  plywood  used.  Seldom  would  that  circum¬ 
stance  occur.  Thus,  there  is  a  need  to  weight  each  price  relative  in  con¬ 
structing  an  aggregative  index  number. 

Choosing  Weighting  Factors 

The  quantities  of  each  item  or  product  in  an  aggregative  index  number  are 
the  logical  candidates  for  explicit  weighting  factors.  There  is  much  dis¬ 
cussion  in  the  literature  concerning  which  quantities  to  use,  base  year,  cur¬ 
rent  year  or  some  mixture  of  each. 

Base  year  quantities  are  the  quantities  of  each  item  in  the  index  purchased 
in  the  base  year.  In  base  year  quantity  construction,  these  quantities  would 
be  used  to  weight  price  relatives  for  all  years  of  the  time  series,  thus 
eliminating  the  effects  of  quantity  changes  from  one  year  to  another  on  the 
price  index  number.  This  approach  does  not  recognize  that  in  the  real  world 
the  mix  of  items  being  purchased  often  changes  from  one  year  to  another.  Base 
year  weighting  is  used  in  constructing  price  relatives  of  the  Laspeyres  type. 
Another  approach,  current  year  weighting,  is  used  in  constructing  aggregative 
price  indexes  of  the  Paasche  type. 

Weighted  Aggregative  Price  Indexes  of  the  Laspeyres  and  the  Paasche  Types 

A  common  approach  for  index  number  construction  is  the  fixed  base  year 
weighting  approach  developed  by  Etieene  Laspeyres  in  1864.  In  addition  to 
collecting  time  series  price  data  for  the  different  items  to  be  aggregated, 
the  approach  requires  collection  of  quantity  data  for  the  base  year  selected. 
The  formulation  of  the  Laspeyres  methods  is: 


"  m 

I 

i-1 

pu 

2io 

_ III 

pio 

Qio 

Where  i  stands  for  the  different  items  numbered  from  1  to  m  and  j  stands  for 
the  different  periods r  j~o  is  the  base  period  or  year,  thus  P,  stands  for 

til 

the  base  year  price  of  the  i  commodity  and  Q.  stands  for  the  base  year 

til  ^ 

quantity  of  the  i  conmodity . 

Extending  the  plywood  and  nails  example,  one  needs  to  collect  quantity 
data  as  well  as  price  data.  Summarizing  the  data  again: 
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Year 

1967 

1969 

1971 

1973 

1975 

Plywood  Pr^— 8 

$84. 12 

$95.06 

$107.32 

$121.35 

$152.72 

Plywood  Qty.  N 

1000 

1025 

1025 

1010 

1010 

UK  *5ie* 
keg 

$15.20 

$15.65 

$16.00 

$15.92 

$18.10 

Nails  Qty .  Kegs 

102 

104 

105 

103 

99 

Note  that  the  quantities  are  expressed  in  the  sane  dimensions  as  the  prices, 
e.g. ,  plywood  in- 1000  ft2  or  M  and  nails  in  kegs.  For  the  Laspeyres  method  of 
construction,  only  the  base  year  Cl 967)  quantities  will  be  used  for  weighting 
purposes.  For  example: 

2 

Pi,1975  Ci,1967 

~2 

J  Pi,1967  Ci,1967\ 

J 

[($152.72X1000)  +  ($18.10)  (100)3 1  .  1Q0  . 
[($  84.12) (1000)  +  ($15.20) (100) JJ 

All  calculations  using  the  Laspeyres  method  of  index  number  construction  use 
base  year  quantities  for  weighting  nunibers.  In  a  similar  fashion,  the  rest 
of  the  index  numbers  calculated  by  the  Laspeyres  method  are: 

Year  1967  1969  1971  1973  1975 

Index  100.0  112.8  127.2  143.6  180.4 


1975 


-  100 


»  100 


In  1874,  H.  Paasche  presented  another  approach  for  constructing  index 
numbers  which  uses  current  period  quantities  as  weights.  The  formulation  of 
the  Paasche  method  is:  _  _ 


N 


100 


l  Pij 
i-1  13 


m 

I 
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I  Pio 


with  the  same  meaning  ascribed  to  the  subscripts  as  previously  defined.  This 
index,  like  the  Laspeyres  index,  is  a  ratio  of  weighted  aggregates.  But  it 
relates  the  sum  of  current  prices  times  current  quantites  to  a  hypothetical 
sum  of  base  year  prices  times  current  quantities.  Using  the  sample  data  for 
an  illustration  again: 
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In  a  similar  fashion,  the  rest  of  the  index  numbers  calculated  by  Paasche  are: 

Year  1967  1969  1971  1973  1975 

Index  100  112.8  127.2  143.5  180.5 

In  the  illustration,  there  was  insufficient  variation  in  the  quantities  pur¬ 
chased  between  the  years  to  produce  a  significant  change  in  the  index  numbers 
calculated.  Nevertheless,  when  there  are  many  items  in  the  calculations,  and 
the  variation  in  the  quantities  being  purchased  is  significant,  the  different 
approaches  will  produce  different  results. 

The  Paasche  and  Laspeyres  weighted  aggregative  indexes  are  superior  to  the 
simple  indices  previously  discussed.  By  the  weighting  approach,  they  are  freed 
from  problems  of  distortion  associated  with  the  size  of  the  units  for  which  the 
prices  are  quoted.  The  Laspeyres  index  is  essentially  the  same  one  used  by  the 
Bureau  of  Labor  Statistics  in  constructing  their  extensive  series  of  price 
indexes.  As  can  be  seen  from  the  differences  in  formulation,  the  data  gather¬ 
ing  task  is  simpler  for  the  Laspeyres  construction  since  only  base  year  quantity 
data  are  needed.  There  is  considerable  argument  in  the  literature  as  to  which, 
if  either,  of  these  two  approaches  is  better.  More  complicated  approaches 
have  been  devised  such  as  the  "Fisher”  ideal  index  to  try  to  eliminate  any 
bias  introduced  by  selecting  only  base  year  or  current  year  quantities  for 
weighting.  There  does  not  seem  to  be  any  conclusion  as  to  a  best  method  for 
index  number  construction. 

weighted  Average  of  Price  Relatives  Indexes 

Another  method  of  index  number  construction  using  a  weighted  average  of 
price  relatives  is  introduced  here  because  it  offers  a  practical  means  of 
combining  previously  constructed  index  numbers  and  specially  constructed  index 
numbers.  This  method  is  also  well  suited  to  using  contractor  accounting 
record  data  to  construct  the  indexes.  The  formulation  follows: 
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weighted  average  of  relatives  index 
number  for  given  time  period  j 
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where 


100 


th 

price  relative  of  the  i  commodity  or 
service  expressed  in  percentages;  or  a 
previously  calculated  index  number  from 
some  source  for  the  i^1  commodity  and  jth 
time  period 


i 


subscript  denoting  the  1th  commodity  or  service  and  ranging 
from  1  to  a 


j  «  subscript  denoting  the  time  period  and  ranging  from  the  base 
period  o  to  n 

the  relative  weight  of  the  i**  commodity 
service  expressed  as  a  relative  of  the 
total  of  all  the  oomnodities  or  services 
during  the  given  time  period 

P,  **  price  (expressed  in  dollars  per  some  unit  of  measure)  of  1th 
°  commodity  or  service  in  base  time  period  o 

P^  -  price  of  1th  commodity  or  service  in  j**  given  time  period 

th  th 

Q.  .  -  quantity  (unit  of  measure)  of  i  commodity  or  service  in  j 
3  given  time  period 
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The  approach  is  illustrated  in  Tables  9-1  through  9-5. 

The  illustration  shows  the  computation  of  an  employee  benefits  index  number 
for  the  years  1967,  1969,  and  1975  using  a  weighted  average  of  price 
relatives,  weighted  by  current  expenditures. 

One  can  observe  that  instead  of  constructing  a  price  relative  for  a 
particular  item  or  service,  one  could  substitute  a  previously  constructed 
index  number  for  1^.  in  the  formulation.  The  ease  of  mixing  previously 

constructed  index  numbers  and  specially  constructed  index  numbers  plus  the 
advantage  of  using  contractor  cost  data  for  weighting  factors  make  it  a 
practical  method  for  the  analyst  to  master. 


Table  9-1 
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CONTRACTOR  ACCOLNTING  DATA 


■  - -  '  ' 

Year 

1967 

1969 

1975 

Mo/  Indirect  Employees 

2000 

2100 

2150 

Overhead  Accounts 

1- Paid  Absences 

2 - Employee  Insurance 

3- Savings-Retirement 

4- Education 

$1,262,000 

552,000 

810,000 

18,000 

$1,398,600 

598,500 

894,600 

21,000 

$1,388,900 

604,150 

991,500 

21,500 

Total  Employee  Benefits 

$2,642,000 

$2,912,100 

$3,005,700 

First,  calculate  prices  from  cost  and  volume  data: 


p  o  Amo  tine  paid  in  the  year  for  service 

*ij  Quantity  of  service  bought  during  the  year 

e.q. ,  Price  of  Paid  absences  in  1967  ■  f  ~2 '00° -  «  $6 31/person 

2000  person 

Table  9-2 


Table  9-3 


INDEX  NUMBERS 

(Base  1967)  100  x  Price  Relative 


Year  1967  1969  1975 


(overhead  Accounts 


1 

100.0 

105.5 

102.3 

2 

100.0 

103.3 

101.8 

3 

100.0 

105.2 

113.8 

4 

100.0 

111.1 

111.1 

Third,  calculate  the  relative  weights  of  the  benefits  in  each  category 
for  each  year: 
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e.g..  Calculate  the  relative  weights  of  paid  absences  to  total  employee 
benefits  for  the  year  1967: 


$1,262 ,000 
$2,642,000 


Table  9-4 


RELATIVE  WEIGHT  OF  BENEFITS 


i  -  i 

Year 

1967 

1969 

1975 

Overhead  Accounts 

1 

0.478 

0.480 

0.462 

2 

0.209 

0.206 

0.201 

3 

0.306 

0.307 

0.330 

4 

0.007 

0.007 

0.007 

1.000 

1.000 

1.000 

Fourth,  calculate  index  numbers  for  each  year  by  summing  the  products  of 
each  account  index  number  and  its  relative  weight: 
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e.g.,  I  -  100  [(105.5  x  0.480)  +  (103.3  x  0.206)  + 

1968  (105.2  x  0.307)  +  (111.1  x  0.007)]  -  105.0 
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CHAPTER  10 


TIME  SERIES  FORECASTING  TECHNIQUES 

This  section  introduces  you  to  a  limited  number  of  basic,  but  neverthe¬ 
less  powerful,  forecasting  techniques  for  analyzing  the  trend  in  time 
series  data. 

The  three  basic  groups  of  techniques  for  analyzing  trend  are :  ( 1 )  line 
fitting  techniques,  (2)  moving  averages,  and  (3)  exponential  averages.  All 
three  of  these  groups  of  techniques  are  naive  because  they  attempt  to  pro¬ 
ject  the  past  into  the  future  with  minimal  attention  paid  to  actual  cause 
and  effect  relationships.  They  are  unsophisticated  in  that  there  are  more 
powerful  (and  more  complex)  techniques  available  for  analyzing  time  series 
data.  Furthermore,  this  section  does  not  pretend  bo  cover  even  these  naive 
and  unsophisticated  techniques  in  great  depth.  The  use  of  line  fitting 
techniques,  moving  averages,  and  exponential  averages  to  analyze  time 
series  data  goes  well  beyond  the  applications  and  examples  presented  here. 
Nevertheless,  the  techniques  discussed  are  useful  and  relatively  powerful 
in  spite  of,  or  perhaps  because  of,  their  simplicity.  These  techniques 
have  been  applied  in  the  "real"  world  and  have  proven  adequate  enough  to 
insure  their  continued  use. 

The  techniques  themselves  are  mathematically  mechanical  operations. 

Fed  a  certain  amount  of  data,  they  will  produce  a  number.  It  would  be 
improper  at  this  point,  however,  to  label  that  number  a  forecast.  All 
these  forecasting  techniques  can  do  is  process  data  into  information. 
Forecasters  forecast.  Not  until  the  forecaster  applies  his  own  knowledge, 
judgment,  and  good  common  sense  to  the  problem  can  this  information  be 
developed  into  a  forecast. 

What  are  the  characteristics  of  a  good  forecaster?  First,  he  must  have 
a  good  working  knowledge  of  the  tools  of  his  trade.  He  must  know  the 
characteristics,  limitations,  strengths,  and  weaknesses  of  the  analytical 
techniques  available  to  him  so  that  he  can: 

(1)  Select  the  most  appropriate  technique  for  the  situation  at  hand. 

(2)  Properly  apply  the  technique. 

(3)  Interpret  the  information  generated  by  application  of  the  technique. 

Second,  he  must  be  knowledgeable  about  the  particular  event  or  variable 
he  is  trying  to  forecast.  The  future  does  not  just  happen;  it  is  the 
result  of  actions  that  have  occurred  in  the  past  and/or  are  occurring  in 
the  present.  If  the  forecaster  attempts  to  forecast  the  future  without 
having  any  conception  of  the  causes  underlying  the  past  behavior  of  his 
data,  he  is  trusting  to  blind  luck. 

Third,  the  forecaster  must  have  good  judgment  and  common  sense.  In 
spite  of  all  the  data  and  all  the  sophisticated  techniques  available  to  him. 


in  the  final  analysis,  the  forecast  must  be  the  product  of  his  own 
thinking.  If  this  were  not  so,  forecasters  could  be  replaced  by 
bookkeepers  and  computer  operators.  Unfortunately,  there  is  no  such 
creature  as  a  totally  objective  forecast.  From  the  beginning  to  the  end  of 
the  analysis,  the  forecaster  must  make  decisions  that  influence  the  final 
outcome  of  his  endeavors.  If  the  results  of  his  analysis  are  contrary  to 
what  the  forecaster's  experience  and  intuitive  logic  have  led  him  to 
expect,  then  he  must  reconcile  the  differences.  The  forecaster  must  be 
certain  that  his  forecast  makes  sense,  that  it  is  plausible,  before  he  pre¬ 
sents  it  to  the  decision  maker. 


Before  we  proceed  any  further  in  our  discussion  of  forecasting,  it  is 
necessary  to  introduce  some  symbology.  Throughout  the  text  the  symbology 
will  consist  of  a  combination  of  capital  letters  and  numbers  followed  by  a 
set  of  parenthesis  containing  small  letters  and  numbers.  The  capital  let¬ 
ters  and  numbers  will  identify  the  particular  subject.  The  small  letters 
and  numbers  contained  in  the  parenthesis  will  be  parameters  that  describe 
the  subject.  I  realize  that  this  is  not  clear,  but  a  few  examples  should 
clarify  what  I  am  saying. 


Lets  Y 
F 
t 
i 

n 


the  variable  we  are  going  to  forecast 
our  forecast  of  the  variable  Y 

the  time  period  from  which  we  are  making  our  forecast 
the  number  of  periods  out  from  time  period  t  that  we  are 
trying  to  forecast  (forecast  interval) 
the  number  of  observations  used  to  make  our  forecast 


Therefore:  Y(t)  »  the  value  of  variable  Y  in  time  period  t 

Y(t+i)  *  the  value  of  variable  Y  in  time  period  t+i 
F(t,t+i)  -  our  forecast  of  variable  Y  made  in  time  period  t 
for  time  period  t+i  ' 


The  capital  letters  Y  and  F  tell  us  whether  we  are  talking  about  the 
actual  value  of  a  variable  or  the  forecasted  value  of  a  variable,  and  the 
small  letters  t  and  i  are  used  to  describe  the  particular  Y  or  F  we  are 
talking  about. 

Since  I  am  using  parenthesis  to  contain  descriptive  information,  I  can¬ 
not  use  parenthesis  for  multiplication.  In  other  words,  Y(t)  does  not  mean 
Y  "times"  t.  Therefore,  whenever  I  want  to  indicate  that  multiplication  is 
to  take  place,  I  will  use  an  asterisk  "*".  Therefore,  Y(t+i)  will  mean  the 
value  of  variable  Y  in  time  period  t+i,  and  Y( t+i)* (t+i)  means  the  value  of 
variable  Y  in  time  period  t+i  times  the  value  (t+i). 

Using  our  new  symbols,  a  forecasting  exercise  would  go  like  this.  We 
are  in  time  period  t.  We  know  the  value  of  our  variable  Y  in  the  current 
time  period;  it  is  Y(t).  We  want  to  forecast  the  value  of  variable  Y  in 
some  future  time  period,  t+i.  Our  forecast,  made  in  time  period  t,  is 
F(t,t+i). 
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Next,  we  need  to  ask  the  question,  what  are  the  characteristics  of  a 
good  forecast.  One  important  characteristic  is  that  the  forecast  be 
accurate.  It  would  be  the  best  of  all  possible  worlds  if  the  future  turned 
out  to  be  a  perfect  mirror  image  of  our  forecast.  Using  our  symbology, 
perfect  accuracy  would  exist  when  our  forecast  of  variable  Y  made  in  time 
period  t  for  time  period  t+i  equaled  the  actual  value  of  Y  in  time  period 
t+i,  or  F(t,t+i)  *  Y( t+i) .  It  is  apparent  from  the  equation  that  we  cannot 
measure  the  accuracy  of  our  forecast  until  we  reach  the  time  period  we  are 
trying  to  forecast.  If  our  forecast  was  good,  we  can  pat  ourselves  on  the 
back  and  take  our  bows.  If  the  forecast  was  bad,  we  can  hide  out  until  the 
repercussions  of  our  error  have  dimmed  in  the  boss's  mind.  The  one  thing 
we  can't  do  is  rectify  the  situation.  Just  like  in  football,  no  amount  of 
Monday  morning  quarterbacking  will  change  the  final  score. 

Unfortunately,  it  is  the  forecaster's  plight  that  the  probability  of 
F(t,t+i)  =  Y(t+i)  is  very  close  to  zero.  When  it  comes  to  absolute  fore¬ 
casts  of  the  future,  forecasters  are  almost  always  wrong.  The  goal  in 
forecasting  is  not  perfection.  The  perfect  forecast  is  yet  to  be  made. 

The  goal  in  forecasting  is  to  be  able  to  predict  the  future  with  enough 
accuracy  that  the  errors  in  your  forecast  are  both  consistent  and  small. 

We  can  measure  the  error  in  our  forecast  by  defining  a  new  symbol,  E, 
which  stands  for  forecasting  error. 

E(F: t ,t+i)  -  Y(t+i)  -  F(t,t+i)  (Equation  10-1) 

The  error  in  our  forecast  is  equal  to  the  difference  between  what  actually 
occurred,  Y(t+i),  and  what  we  forecasted  would  occur,  F(t,t+i). 

For  example,  let  us  suppose  that  the  following  table  represents  our 
record  of  forecasts  made  in  the  past  using  a  forecasting  technique  which  vie 
will  refer  to  as  FI. 


Table  10-1 


’(t+i) 

FI (t,t+i) 

E (FI : t,t+i) 

293 

309 

-16 

312 

322 

-10 

347 

334 

13 

369 

387 

-18 

384 

367 

17 

420 

406 

14 

441 

454 

-13 

468 

451 

17 

495 

482 

13 

516 

533 

-17 

The  question  we  must  now  ask  is  how  do  we  measure  the  accuracy  of  our 
forecast.  To  answer  this  question,  we  must  examine  the  error  term,  E(F1) 
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One  possible  statistic  we  could  use  to  describe  the  error  term  would  be 
the  range,  i.e.,  the  highest  and  lowest  values  for  E(F1 ) .  In  our  example , 
the  range  of  our  error  term  is  from  >18  to  +17.  The  smaller  the  range  of 
values  for  the  error  terms,  the  more  accurate  our  forecast.  Suppose 
someone  made  a  forecast  and  told  you  the  range  of  their  past  forecasting 
errors  was  -$9,000,000  to  +$8,000,000.  What  would  your  conclusion  be 
regarding  the  accuracy  of  their  forecasting  technique?  The  question  cannot 
be  answered  because  you  have  no  knowledge  regarding  the  value  of  the 
variable  being  forecasted.  If  they  were  forecasting  a  value  of  $20,000,000 
the  range  of  their  error  term  suggests  a  very  inaccurate  forecast.  If 
their  forecasted  value  was  $2  billion,  the  range  of  the  error  term  seems 
very  small  by  comparison.  It  would  seem  worthwhile,  therefore,  to  compare 
the  range  of  our  error  terms  to  the  values  of  the  variable  being  forecasted. 

One  method  would  be  to  express  the  range  of  the  error  term  as  percen¬ 
tages  of  the  mean  of  Y(t),  or  Y(t).  In  our  example  from  Table  10-1,  Y(t)  * 
4045/10  *  404.5.  Therefore,  the  range  of  our_error  term  expressed  as  a 
percent  of  Y(t)  would  be  -4.45%  to  +4.20%  of  Y(t).  This  method  will  give 
you  a  fairly  consistent_measure  of  the  magnitude  of  the  error  term  when_ 

Y(t)  fluctuates  around  Y(t)  over  time  and  the  variation  in  E(t)  around  E(t) 
is  relatively  constant.  A  second  method  would  be  to  convert  each  error 
term  to  a  percent  of  the  observation  being  forecasted  and  express  the  range 
of  the  error  term  as  the  smallest  and  largest  percentage  error.  In  our 
example  from  Table  10-1,  the  smallest  error  term  is  -5.46%  and  the  largest 
error  term  is  +4.43%.  These  error  terms  correspond  to  Y(t+i)  •  293  and 
Y(t+i)  ■  384,  respectively.  Therefore,  using  this  method,  we  would  express 
the  range  in  our  error  term  as  -5.48%  to  +4.43%  of  Y(t).  This  method  will 
give  you  a  fairly  consistent  measure  of  the  magnitude  of  the  error  term 
when  Y(t)  is  increasing  (or  decreasing)  over  time  and  the  error  term  is 
increasing  (or  decreasing)  over  time  at  the  same  relative  rate. 


Another  possible  statistic  would  be  the  average  value  of  the  error 
terms  E(F1)  ■  !E(F1)/n,  where  n  equates  to  the  number  of  error  terms. 
However,  when  we  add  up  the  error  terms  in  our  example,  E(E(F1)  -  0  and 
E(F1)  -  0.  We  can  see  from  our  data  that  the  negative  errors  and  the  posi¬ 
tive  errors  are  cancelling  each  other  out.  This  could  happen  whether  the 
error  terms  are  large  or  small.  Consequently,  E(F1)  does  not  seem  to  convey 
any  significant  information  about  the  characteristics  of  the  error  team. 

One  method  for  eliminating  the  effect  of  negative  and  positive  signs  is  to 
examine  the  squared  error  terms.  We  can  find  the  sum  of  the  squared  error 
terms  and  use  this  number  to  calculate  a  statistic  that  measured  the 
dispersion  of  our  forecasts  around  the  actual  observed  values.  In  our 
example,  Ee2(F1)“2250.  Dividing  this  sum  by  the  number  of  error  terms  and 
we  get  lE2(F1)/n  -  2250/10  ■  225.  This  is  the  average  of  our  squared  error 
terms .  Therefore,  we  must  take  the  square  root,  yi E* (FI )/n  “  ^2250/ 10  - 

/225  -  15.  This  number  we  will  call  the  standard  error  of  the  forecast. 


S  (F) 


yi 


IE2 (F:t,t+i) 
n 


(Equation  10-2) 


10-4 


« 


«r 


S(F)  measures  the  dispersion  of  our  forecasts  around  the  actual  values  of  Y. 
The  smaller  the  standard  error  of  the  forecast,  the  more  accurate  our  fore¬ 
cast. 


Another  method  of  counteracting  the  cancelling  effect  of  positive  and 
negative  signs  is  to  take  the  average  of  the  absolute  values  of  the  error 
term.  When  you  take  the  absolute  value  of  a  number,  you  treat  the  number  as 
though  it  were  positive  regardless  of  its  original  sign.  Going  back  to  our 
example  Z [e(F1 ) |  “  148.  The  average  of  the  absolute  values  would  be 

Z|E(F1)|/n  ”  148/10  ”  14.8.  This  number  is  used  frequently  to  measure  the 
accuracy  of  forecasts,  and  is  called  the  mean  absolute  deviation  or  MAD, 
where 

MAD (F)  =  s|E(F;t,t+i) |  (Equation  10-3) 


The  MAD(F)  is  always  smaller  than  the  standard  error  of  the  forecast,  and 
the  smaller  the  MAD(F)  the  more  accurate  our  forecast. 

In  statistics,  the  standard  error  (or  standard  deviation)  was  the 
measure  of  dispersion  most  used  to  describe  the  behavior  of  a  data  set. 

This  was  true  because  in  statistics  we  dealt  with  probabilities,  and  our 
probabilities  were  determined  by  the  standard  error  of  our  particular  pro¬ 
bability  distribution. 

In  time  series  forecasting,  however,  we  cannot  do  probabilistic  fore¬ 
casting  (within  the  limits  of  this  discussion).  Our  measure  of  dispersion 
is  used  to  give  us  an  indication  of  the  expected  accuracy  of  our  forecast. 
Therefore,  we  should  focus  our  attention  on  the  measure  of  dispersion  that 
has  the  greatest  information  value,  i.e.,  we  can  understand  what  it  is 
telling  us. 

In  my  opinion,  the  concept  underlying  the  standard  error  is  a  nebulous 
concept  at  best.  Just  what  does  the  average  of  the  sum  of  the  squared 
deviations  of  the  error  term  around  its  mean  measure?  Does  the  concept 
become  any  more  clear  when  you  take  the  square  root?  Stripped  of  its  pro¬ 
babilistic  characteristics,  the  standard  error  as  a  measure  of  dispersion 
is  very  low  in  information  content.  It  does  not  speak  to  me. 

The  MAD(F),  by  comparison,  is  informative.  The  fact  that  the  MAD(F) 
cannot  be  associated  with  a  particular  probability  distribution  is  irrele¬ 
vant  in  time  series  forecasting.  The  MAD(F)  does  speak  to  me.  It  tells 
me,  on  the  average,  how  much  my  forecasts  have  varied  from  the  observations 
they  were  trying  to  predict. 

Our  three  measures  of  forecasting  accuracy,  the  range,  the  standard 
error  of  the  forecast,  and  the  MAD(F)  do  not  always  point  to  the  same  fore¬ 
casting  technique  as  being  the  most  accurate.  For  example,  suppose  we  had 
a  second  forecasting  technique  labeled  F2,  which  provided  the  following 
results . 
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Table  10-2 


Y(t+i) 

F2(t,t+i) 

E (F2 : t , t+i ) 

293 

278 

15 

312 

320 

-8 

347 

321 

26 

369 

360 

9 

384 

392 

-8 

420 

441 

-21 

441 

432 

9 

468 

493 

-25 

495 

503 

-8 

516 

504 

12 

The 

range  for  E(F2)  goes 

from  the  -25  to  +26.  The  £B(F2)  ®  1, 

so  E(F2) 

«  .  1 . 

The  £E2(F2)  - 

2465,  so 

S(F2)  «  V2465/10 

-^246. 5  =  15.7. 

The 

£  |E(F2) 

j  =  141,  so 

MAD(F2)  - 

141/10  -  14.1. 

To 

evaluate  F(2) 

against 

F( 1 )  we  must  compare  their  respective 

error 

terms • 

We  do  this  be  comparing  the  statistics 

which  describe  the 

behavior 

of  the 

error  terms • 

Table  10-3 

V 

Statistics  FI 

F2 

Range 

-18  to  17 

-25  to  26 

S(F) 

15.0 

15.7 

MAD(F) 

14.8 

14.1 

The  range  of  values  for  the  error  tern  is  smaller  for  fi  than  F2  and 
so  is  the  standard  error  of  the  forecast.  F2r  however,  has  a  smaller  mean 
absolute  deviation.  Since  our  previous  discussion  lead  us  to  the  conclu¬ 
sion  that  the  MAD(F)  is  a  more  informative  measure  of  forecasting  accuracy, 
why  not  ignore  the  standard  error  of  the  forecast  altogether?  The  answer 
is  that  there  is  a  relationship  between  S(F)  and  the  MAD(F)  that  reveals 
information  about  the  behavior  of  the  error  term  which  is  not  communicated 
by  the  MAD(F)  alone.  The  closer  the  absolute  value  of  the  error  terms  are 
to  the  MAD(F)  the  closer  the  S(F)  will  be  to  the  MAO(F) .  Conversely,  the 
further  away  the  absolute  value  of  the  error  terms  are  to  the  MAD(F),  the 
further  away  S(F)  will  be  from  the  MAD(F).  In  our  example  from  Table  10-1, 
the  absolute  values  of  the  error  terms  fluctuated  around  the  MAD(FI)  with 
little  deviation.  Five  of  the  ten  error  terms  are  within  the  range  of  the 
MAD(FI)  +  3.  For  FI,  the  S(F1)  and  MAD(FI)  are  reasonably  close  together, 

15  versus  14.8.  In  our  example  from  Table  10-2,  the  absolute  values  of  the 
error  terms  fluctuate  widely  around  the  MAD(F2).  Only  one  of  the  ten  error 
terms  are  within  the  range  of  MAD(F2)  +  1,  and  only  two  error  terms  fall 
within  the  range  of  the  MAD(F2)  +  2.  You  have  to  extend  the  range  to  the 
MAD(F2)  +  7  to  pick  up  eight  of  the  ten  error  terms.  For  F2,  the  S(F1)  and 
MAD(F2)  iure  further  apart,  15.7  versus  14.1.  The  S(F)  and  the  MkO(F)  are 
equal  to  each  other  only  when  the  absolute  value  of  the  error  term  is  a 
constant  for  every  observation.  The  greater  the  variability  in  [l(F)| 
around  the  MAD(F)  the  greater  the  difference  between  the  S(F)  and  the  MAD(F) . 
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Therefore,  we  can  expect  that  when  the  absolute  values  of  the  error 
term  fall  within  a  small  range,  the  S(F)  and  the  KAI>(F)  will  be  close 
together*  Also,  if  S(F)  and  the  MAO(F)  are  close  together,  we  know  that 
the  forecasting  technique  under  investigation  produces  error  terms  that  are 
of  a  consistent  size.  We  can  also  expect  that  when  the  absolute  value  of 
the  error  terms  occupies  a  large  range,  S(F)  and  the  MAO(F)  will  be  further 
apart.  Likewise,  when  S(F)  and  the  MAD(F)  are  far  apart,  we  know  that  the 
forecasting  technique  under  investigation  produces  error  terms  that  vary 
significantly  in  size. 

The  forecasting  error  for  FI  is  more  consistent  from  period  to  period. 
You  can  expect  your  forecasts  to  be  within  plus  or  minus  IS,  give  or  take 
one  or  two.  Relying  on  F2,  however,  you  cannot  predict  whether  your  next 
forecast  will  be  right  to  within  plus  or  minus  25.  Therefore,  risk  of 
error  becomes  more  difficult  to  manage  with  F2  than  with  FI.  However,  F2 
gave  more  accurate  forecasts  in  seven  out  of  ten  times,  while  FI  was  more 
accurate  only  three  times  out  of  ten  times.  So,  using  accuracy  as  our  cri¬ 
teria,  which  is  the  better  forecasting  technique?  It  appears  that  there  is 
no  one  unequivocal  answer  to  this  question.  We  must  fall  back  on  the  old 
adage,  it  depends  on  the  situation.  If  your  situation  is  such  that  you 
cannot  tolerate  large  forecasting  errors  because  the  cost  of  missing  a 
forecast  by  a  large  amount  is  unacceptable ,  you  would  select  FI  for  your 
forecasts.  If  you  are  in  a  situation  where  you  must  reserve  significant 
amounts  of  scarce  resources  to  protect  yourself  against  the  cost  of  large 
forecasting  errors,  FI  would  require  a  smaller  reserve.  If  you  are  in  a 
situation  where  the  benefits  derived  from  accurate  forecasts  exceed  the 
costs  of  large  forecasting  errors,  and  you  can  afford  the  cost  of  an  occa¬ 
sional  forecasting  error  of  large  proportions,  you  would  choose  F2.  If 
your  success  depends  upon  being  right  more  often  than  you  are  wrong,  you 
may  be  forced  to  choose  F2  in  spite  at  the  risk  of  a  large  error. 

As  you  can  see,  the  selection  of  an  appropriate  forecasting  technique 
depends  upon  three  things  in  addition  to  the  measured  accuracy  of  the  fore¬ 
casting  technique  itself :  ( 1 )  the  cost  to  you  of  making  a  forecast  that 

turns  out  to  be  very  wrong,  (2)  the  benefits  derived  from  making  a  forecast 
that  turns  out  to  be  very  accurate,  and  (3)  your  ability  to  tolerate  risk. 

Another  characteristic  we  would  like  our  forecast  to  have  is  that  it  be 
responsive  to  changes  in  the  environment.  Almost  all  forecasts  are  extra¬ 
polations  of  the  past  into  the  future,  from  the  known  to  the  unknown. 
However,  the  past  does  not  necessarily  repeat  itself  in  duplicate.  Events 
change,  and  these  changes  will  only  become  evident  as  they  appear  in  our 
data  base.  Therefore,  a  forecasting  technique  would  be  responsive  to 
changing  conditions  only  if  it  was  very  sensitive  to  changing  values 
reflected  in  the  most  recent  observations. 

However,  there  is  a  tradeoff.  Events  seldom  occur  in  nice  regular  pat¬ 
terns.  The  variable  we  are  trying  to  forecast  is  subject  to  random  fluc¬ 
tuations,  sometimes  up  and  sometimes  down,  because  of  the  combined  effect 
of  a  multitude  of  causation  variables  that  are  individually  trivial  in 


their  impact.  These  random  fluctuations  will  overlay  the  primary  cause  and 
effect  relationships  determining  our  forecasted  variable.  We  do  not  want 
our  forecast  to  change  because  of  random  fluctuations  when  the  primary 
cause  and  effect  relationships  are  stable.  We  do  want  our  forecasts  to 
respond  to  changes  in  the  primary  cause  and  effect  relationships.  However, 
if  we  design  our  forecasting  technique  to  be  sensitive  to  changes  in  the 
most  recent  data  in  order  to  accomplish  our  latter  objective,  our  forecasts 
will  also  respond  to  random  fluctuations  as  we  collect  new  data.  If  we 
desensitize  our  forecasting  technique  to  random  fluctuations  we  lose  the 
ability  to  detect  changing  conditions.  How  do  you  accomplish  both 
objectives?  The  answer  is,  you  don't. 

As  a  forecaster  you  are  confronted  with  the  eternal  dilemma  of  choosing 
between  responsive  and  stable  forecasts.  Your  choice  will  depend  on  the 
particular  situation.  You  will  want  to  use  a  forecasting  technique  that  is 
responsive  to  changing  conditions  even  at  the  expense  of  distortion  caused 
by  random  fluctuation  whenever :  ( 1 )  the  probability  that  conditions  will 

change  is  high,  (2)  the  cost  of  making  a  decision  based  on  an  erroneous 
forecast  is  high,  (3)  the  benefit  of  being  able  to  respond  quickly  to 
changed  conditions  is  high,  (4)  the  data  is  subject  to  relatively  small 
random  f luctuations ,  and  (5)  the  cost  of  making  decisions  based  on  fore¬ 
casts  distorted  by  random  fluctuations  is  low. 

You  will  want  to  use  a  forecasting  technique  that  produces  relatively 
stable  forecasts  whenever :  (1)  the  data  is  subject  to  large  random  fluc¬ 

tuations,  (2)  the  cost  of  making  decisions  based  on  forecasts  distorted  by 
random  fluctuations  is  high,  (3)  the  probability  that  conditions  will 
change  is  low,  (4)  the  cost  of  making  a  decision  based  on  erroneous  fore¬ 
casts  is  low,  and  (5)  the  benefit  of  being  able  to  respond  quickly  to 
changed  conditions  is  low.  It  is  unfortunate  that  the  stipulations  men¬ 
tioned  above  usually  do  not  occur  simultaneously  in  a  way  that  makes  the 
choice  between  stability  and  responsiveness  an  obvious  one.  There  is  never 
one  right  answer.  You  can  only  make  reasonable  choices  based  on  judgment 
and  common  sense  and  then  learn  to  live  with  the  risk  inherent  in  an  uncer¬ 
tain  future. 

A  third  desired  characteristic  of  a  forecast  is  that  it  be  reliable. 
Every  forecasting  technique  is  based  upon  its  own  set  of  assumptions  which 
under ly  and,  to  a  certain  extent,  determine  the  forecast.  Reliability,  as 
a  forecast  characteristic,  depends  upon  the  continued  validity  of  these 
assumptions  in  the  future.  The  assumption  underlying  the  forecast  must 
remain  valid  over  time  from  the  present  to  the  period  being  forecasted 
before  the  forecast  can  be  considered  reliable.  As  the  possibility 
increases  that  the  assumptions  upon  which  the  forecast  is  based  are  losing 
their  validity,  the  forecast  itself  becomes  more  and  more  unreliable . 

The  purpose  of  a  forecast  is  to  provide  information  for  the  decision 
making  process.  Therefore,  the  last  characteristic  we  want  our  forecasting 
technique  to  possess  is  a  favorable  ratio  of  cost  to  benefits.  The  cost  of 
making  a  forecast  can  be  measured  in  three  ways,  money  cost,  time  cost,  and 
psychological  cost. 
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The  techniques  discussed  in  this  text  are  relatively  cheap  to  make  in 
terms  of  dollar  cost.  The  greatest  expense  is  probably  occurred  in  the 
performance  of  the  data  gathering  function.  Once  data  is  placed  in  a  file 
accessible  to  the  computer,  programs  written  for  the  types  of  techniques 
with  which  we  are  concerned  cost  very  little  money  to  run.  It  is  unlikely 
that  cost  would  be  a  determining  factor  in  your  selection  unless  you  move 
on  to  more  sophisticated  forecasting  techniques. 


Time  may  become  a  critical  element  in  selecting  a  forecasting  tech¬ 
nique,  even  at  our  naive  level.  If  a  decision  must  be  made  tomorrow,  a 
forecasting  technique  that  will  not  produce  a  usable  forecast  in  the  amount 
of  time  available  is  worthless.  Therefore,  time  constraints  may  restrict 
the  choices  you  can  make  to  respond  to  a  particular  problem. 


Psychological  costs  are  more  nebulous  and  difficult  to  define.  What  is 
the  cost  to  a  decision  maker  when  he  is  faced  with  making  a  decision  based 
on  information  produced  by  a  forecasting  technique  he  does  not  understand 
and  therefore,  has  little  confidence  in?  A  forecasting  technique  that  can 
be  understood  by  the  decision  maker  because  it  falls  within  his  realm  of 
competence  may  be  a  better  choice  over  a  sophisticated  but  complicated 
forecasting  technique  completely  beyond  his  comprehension.  The  role  of  the 
forecaster  is  to  aid  and  assist  the  decision  maker,  not  to  just  make  fore¬ 
casts.  Consequently,  the  acceptance  by  the  decision  maker  of  the 
forecaster's  work  may  be  as  critical  as  the  forecast  itself  if  the  forecast 
is  to  be  of  any  real  value. 


Good  forecasts  enhance  the  decision  making  process  in  two  ways.  First, 
a  good  forecast  may  provide  some  knowledge  about  a  future  that  was  hereto¬ 
fore  unknown.  This  expands  the  information  base  upon  which  the  decision 
maker  must  make  his  decision.  By  reducing  the  unknowns  in  the  decision 
making  process,  the  decision  maker  may  be  able  to  make  more  informed, 
objective,  and  therefore  more  reasonable  decisions. 


Second,  a  good  forecast  may  confirm  or  support  previously  known  infor¬ 
mation  or  reinforce  intuitive  knowledge.  The  forecast  may  not  result  in  a 
different  decision,  but  it  will  reduce  the  risk  or  uncertainty  inherent  in 
making  the  decision.  The  benefits  are  primarily  psychological.  Being  able 
to  make  decisions  in  an  environment  of  reduced  risk  or  uncertainty  alle¬ 
viates  much  of  the  pressure  (and  fear)  of  both  making  and  having  to  live 
with  the  consequences  of  a  bad  decision. 


In  conclusion,  we  want  a  forecasting  technique  that  will  deliver 
accurate  forecasts,  be  responsive  to  changing  conditions  while  immune  to 
the  effects  of  random  fluctuations,  and  will  deliver  benefits  to  the  deci¬ 
sion  maker  in  excess  of  its  costs. 


Anytime  we  have  a  variable  for  which  we  have  measurements  over  time/ 
time  series  analysis  can  be  applied  provided  that  two  essential  assumptions 
are  found  to  be  valid.  Time  series  analysis  attempts  to  define  the  path  of 
a  particular  variable  over  time,  a  path  that  will  be  defined  by  some  mathe¬ 
matical  relationships  which  says  Y(t)  -  f(t). 

The  mathematical  symbol  "f",  meaning  "function  of",  implies  a  cause  and 
effect  relationship.  To  say  that  Y(t)  is  a  function  of  time  implies  that 
time  causes  Y(t)  to  occur.  For  almost  all  time  series  data  this  is 
definitely  not  true.  The  variable  Y(t)  is  almost  never  caused  by  time. 
However,  if  you  believe  that  every  effect  has  a  cause,  Y(t)  must  be  caused 
by  something.  Furthermore,  whatever  is  causing  Y(t)  to  occur  is  not  being 
accounted  for  in  our  analysis. 

If  you  are  doing  an  analysis  and  you  want  to  omit  a  variable,  there  are 
two  ways  to  do  it.  One  method  is  to  assume  that  the  effect  this  one 
variable  will  have  on  the  final  outcome  of  your  analysis  is  trivial  or 
negligible,  and  therefore,  adding  the  variable  to  your  analysis  only 
complicates  matters  without  necessarily  improving  on  the  final  product.  In 
the  case  of  time  series  analysis,  we  are  talking  about  the  actual  causation 
variables,  so  this  argument  will  not  perform  the  deed  for  us. 

The  second  method  for  omitting  variables  from  an  analysis  is  to  assume 
(1)  that  the  variable  you  wish  to  omit  is  not  changing,  and  (2)  that  the 
effect  this  variable  is  having  on  your  analysis  is  also  unchanging.  Both 
conditions  are  necessary.  For  example,  suppose  you  are  trying  to  forecast 
a  variable  Q  which  is  caused  by  variable  R  along  with  other  variables.  If 
the  value  of  R  remains  constant,  and  if  the  effect  R  has  on  Q  also  remains 
constant,  you  can  omit  R  from  your  analysis. 

It  is  this  second  method  which  we  fall  back  on  in  time  series  analysis. 
However ,  we  are  looking  at  changes  in  Y(t)  over  time.  If  all  of  the  causa¬ 
tion  variables  do  not  change  and  the  effect  these  variables  are  having  on 
Y(t)  do  not  change,  Y(t)  itself  will  not  change.  We  know  that  Y(t)  changes 
over  time,  so  tie  must  modify  our  assumption  slightly.  If  the  variables 
which  are  causing  Y(t)  to  change  continue  to  move  along  the  same  path  in 
the  future  as  they  have  in  the  past,  and  they  continue  to  exert  their 
effects  on  Y(t)  with  the  same  magnitude  of  force  and  direction  as  they  have 
in  the  past,  the  causation  variables  can  be  omitted  from  our  analysis. 

Only  when  both  of  these  conditions  are  present  can  we  base  our  forecast  of 
Y(t)  on  the  variable's  past  history  alone. 

An  obvious,  but  critical,  conclusion  should  come  leaping  into  your  mind 
at  this  point.  YOU  CANNOT  USE  TIME  SERIES  ANALYSIS  TO  FORECAST  A  VARIABLE 
WHEN  YOU  KNOW  NOTHING  ABOUT  THE  FORCES  THAT  ARE  ACTUALLY  CAUSING  THAT 
VARIABLE  TO  CHANGE  OVER  TIME.  The  validity  of  the  above  two  assumptions 
about  the  behavior  of  the  causation  variables  is  essential  for  a  valid  time 
series  forecast.  If  either  one  of  the  two  assumptions  are  violated,  the 
time  series  forecasting  techniques  discussed  in  this  course  are  not  appli¬ 
cable  . 


10-10 


It  must  be  equally  obvious  that  to  know  whether  or  not  the  assumptions 
are  valid,  you  must  know  something  about  the  causation  variables.  As  a 
minimum,  you  must  be  able  to  identify  the  major  causation  variables,  and 
determine  whether  or  not  there  is  any  basis  for  concluding  that  ( 1 )  they 
are  not  going  to  continue  to  move  along  their  historical  paths,  and  (2) 
they  are  not  going  to  continue  to  exert  their  influence  through  historical 
cause  and  effect  relationships. 

Generally,  it  is  not  unrealistic  to  accept  the  validity  of  the  two 
basic  assumptions  underlying  time  series  forecasting  unless  there  is  evi¬ 
dence  to  the  contrary.  If  you  cannot  find  any  plausible  reason  for 
believing  that  one  or  both  of  the  assumptions  are  not  valid,  proceed  with 
your  forecast.  However,  if  you  want  to  have  any  confidence  in  your  fore¬ 
cast,  your  investigation  of  the  causation  variables  must  be  a  serious  one. 

If  the  assumptions  are  not  valid,  you  can  have  no  more  faith  in  your  fore¬ 
cast  that  you  could  have  in  a  pure  guess.  The  mathematical  sophistication 
of  the  forecasting  technique  itself  will  be  only  illusionary;  it  will  add 
nothing  to  the  quality  of  your  prediction. 

It  is  worthwhile  to  note  that  the  further  out  into  the  future  we  pro¬ 
ject  the  forecast,  the  more  likely  it  will  be  that  at  least  one  of  our 
assumptions  will  not  hold  true.  If  either  one,  or  both,  of  the  assumptions 
is  not  valid,  the  further  into  the  future  we  try  to  predict,  the  more  error 
we  are  likely  to  have  in  our  forecast.  Consequently,  under  most  cir¬ 
cumstances,  forecasts  based  on  time  series  analysis  are  limited  in  their 
ability  to  see  very  far  into  the  future  with  any  degree  of  reliability. 

The  techniques  discussed  in  this  text  are  good  for  making  short  range  fore¬ 
casts,  but  their  value  for  long  range  forecasts  is  limited  by  their 
underlying  assumptions. 

For  example,  suppose  we  had  plotted  wage  rates  against  time  (Figure  10-1) 
Our  historical  data  goes  from  1970  through  1977. 


Figure  10-1 


11.00  p  WAGE 
RATE 

10.00  - 


forecast 

ERROR 


1971  1972  1973  1974  1975  1976  1977  1978  1979  1980  1981 


I 


If  we  were  to  make  a  forecast  from  this  data,  we  could  easily  assume 
from  the  plot-points  alone  that  the  wage  rate  is  increasing  along  the 
straight  line  path  AB.  However,  suppose  that  in  1975,  the  cost  of  living 
started  to  increase  at  a  faster  rate  than  it  had  in  the  past*  To  keep  up 
with  the  cost  of  living,  workers  would  demand  higher  and  higher  wage 
increases*  In  this  case,  our  assumption  that  the  causation  variables  must 
continue  along  their  historical  path  has  been  violated.  The  cost  of  living 
is  growing  at  a  different  rate  than  before.  Therefore,  wage  rates  may 
increase  along  the  line  AC.  If  we  erroneously  conclude  that  our  assump¬ 
tions  «u:e  valid  and  forecast  accordingly,  you  can  see  that  the  error  in  our 
forecast  (the  vertical  difference  between  AB  and  AC)  increases  the  further 
out  into  the  future  we  try  to  predict.  Furthermore,  we  have  no  way  of 
measuring  the  potential  magnitude  of  our  forecasting  error  given  that  we 
make  the  wrong  assumptions .  Therefore,  the  further  out  into  the  future  you 
need  to  forecast,  the  more  thorough  your  analysis  of  the  causation 
variables  must  be. 

Let  us  suppose  that  we  are  in  a  situation  where  we  can  validly  apply 
time  series  forecasting  techniques  to  our  data.  Where  do  we  start?  In 
time  series  analysis  we  are  looking  for  the  path  along  which  we  expect 
our  variable  to  move  in  the  future.  This  path  can  be  divided  into 
four  basic  elements:  trend,  seasonal  fluctuations,  cyclical  fluctuations, 
and  random  fluctuations.  In  this  section  we  are  going  to  ignore  seasonal 
and  cyclical  fluctuations  by  assuming  that  these  elements  are  not  present 
in  our  data.  Therefore,  we  are  only  going  to  be  concerned  with  the  analy¬ 
sis  of  trend  and  the  presence  of  random  fluctuations. 

Trend  is  the  primary  direction  of  the  path  the  variable  will  follow 
over  time.  In  Figure  10-1,  the  lines  AB  and  AC  would  represent  two  dif¬ 
ferent  forecasts  of  the  trend  in  wage  rates.  There  are  seven  basic  trend 
lines  that  a  time  series  variable  can  follow. 

In  Figure  10-2,  lines  AB,  CD  and  EF  all  represent  trends  where  Y(t) 
increases  over  time.  The  differences  among  the  three  lines  reflect  the 
differences  in  the  rate  at  which  Y(t)  is  increasing.  Lines  GH,  U,  and  KL 
all  represent  trends  where  Y(t)  decreases  over  time.  Line  MN  represents  a 
situation  which  we  shall  call  zero  trend.  With  zero  trend,  Y(t)  fluctuates 
around  a  central  value  which  remains  constant  over  time. 

Determining  the  location  of  a  trend  line  requires  three  tasks.  First, 
you  must  decide  whether  there  is  an  increasing,  decreasing,  or  zero  trend. 
Second,  if  the  trend  line  is  increasing  or  decreasing,  you  must  decide 
whether  the  trend  is  linear  or  nonlinear.  Third,  you  must  fit  a  trend  line 
to  the  data  in  order  to  be  able  to  extrapolate  from  the  trend  line  into  the 
future. 

The  first  step  in  locating  the  trend  line  is  to  determine  the  number  of 
data  points  you  wish  to  include  in  your  analysis.  The  trend  line  is  a  time 
path,  and  you  want  to  find  the  trend  line  for  your  variable  Y(t)  in  time 
period  NOW.  Therefore,  you  want  to  limit  the  observations  in  your  data  set 
to  those  data  points  that  reflect  Y(t)'s  current  trend  line. 


However,  trends  do  change  over  time.  In  order  to  know  how  many  obser¬ 
vations  you  can  include  in  your  data  set  you  must  be  able  to  determine  when 
the  last  change  in  trend  occurred.  An  excellent  starting  point  would  be  to 
plot  the  data  and  take  a  look.  Suppose  your  plot  looks  like  Figure  10-3. 


1950  1955  1960  1965  1970  1975  TIME  (t) 

At  what  point  in  time  did  your  trend  line  change  it's  course?  You  can¬ 
not  get  your  answer  from  a  graphic  analysis .  The  best  a  graphic  analysis 
can  do  is  identify  those  points  in  time  where  possible  changes  in  the  trend 
line  may  have  occurred.  On  our  plot  it  appears  that  the  trend  line  may 
have  changed  course  in  1963,  and  again  in  1970. 

Having  identified  these  two  potential  points  in  time,  how  do  we  deter¬ 
mine  if,  in  fact,  a  change  in  the  trend  line  did  occur.  You  must  look  to 
the  cause  and  effect  variables  and  their  relationships  to  Y(t)  to  find  the 
answer  to  your  question.  If  the  trend  line  did  change,  it  changed  for  a 
reason.  Either  one  or  both  of  two  events  must  have  occurred.  Either  the 
cause  and  effect  variables  determining  Y(t)  shifted  from  their  historical 
paths,  or  there  has  been  a  change  in  the  historical  cause  and  effect  rela¬ 
tionships,  or  both.  In  other  words,  you  must  test  the  two  basic  assump¬ 
tions  behind  time  series  analysis  and  determine  whether  or  not  they 
remained  valid  during  those  points  in  time  where  you  suspect  a  possible 
change  in  the  trend  line  may  have  occurred. 

Suppose  your  analysis  revealed  no  basis  for  assuming  a  change  in  the 
trend  line  in  1970.  However,  there  were  changes  in  1963  that  support  the 
hypothesis  that  1963  was  the  beginning  of  a  new  trend.  You  have  now 
answered  your  question  about  the  number  of  observations  to  include  in  your 
data  set  for  analysis.  The  current  trend  line  has  been  continuing  since 
1963,  the  date  of  the  last  change  in  trend.  Therefore,  all  data  prior  to 
1963  must  be  discarded  in  your  analysis.  Pre-1963  data  represents  a  dif¬ 
ferent  trend  line  and  incorporation  of  that  data  in  your  analysis  will  only 
distort  your  results. 
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Having  identified  the  observations  that  reflect  the  current  trend,  you 
must  now  decide  whether  or  not  you  can  reliably  project  that  trend  into  the 
future.  At  this  point  a  caveat  is  in  order  regarding  time  series  analysis . 
Forecasts  based  ug>on  time  series  analysis  are  totally  incapable  of  pre¬ 
dicting  future  changes  in  trend.  They  may  react  to  actual  changes  in  trend 
as  new  observations  are  accumulated  that  reflect  the  change,  a  charac¬ 
teristic  that  we  have  labeled  responsiveness,  but  they  have  no  predictive 
value.  Therefore,  every  time  series  forecast  is  based  on  the  assumption 
that  there  will  be  no  future  changes  in  the  current  trend. 

To  determine  whether  or  not  you  can  reliably  project  the  current  trend 
into  the  future  \s  the  basis  for  your  forecast,  you  must  again  examine  the 
basic  assumptions  underlying  time  series  analysis,  only  this  time  you  must 
speculate  about  the  future.  Will  the  assumptions  remain  valid  from  the 
present  to  the  point  in  time  that  you  are  trying  to  forecast?  And  again 
you  must  rely  upon  the  basic  guidance  that  says  if,  after  a  thorough 
investigation,  you  fail  to  find  evidence  to  the  contrary  accept  the  con¬ 
tinued  validity  of  the  basic  assumptions  as  reasonably  probable ,  and  use 
good  judgment  when  incorporating  the  forecast  into  the  decision  making 
process . 

Once  you  have  decided  which  observations  to  include  in  your  analysis 
you  are  ready  to  begin  your  search  for  the  trend  line.  Your  first  task  is 
to  determine  whether  there  is  an  increasing,  decreasing,  or  zero  trend. 
Suppose  you  had  the  following  data. 


Table  10-4 


A  graph  of  our  data  would  look  as  follows: 

Figure  10-4 
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If  our  graph  of  the  data  revealed  an  obvious  upward  or  downward  trend, 
we  would  have  answered  our  question  and  could  then  proceed  to  the  next  step 
of  our  analysis.  However,  for  our  data  the  trend,  if  it  exists,  is  not 
obvious  (at  least  not  to  me).  Therefore,  we  need  a  statistical  test  that 
will  give  us  a  more  conclusive  answer  when  our  graphic  analysis  fails  us. 
The  test  we  are  going  to  use  relies  on  a  statistic  called  the  Spearman  Rank 
Correlation  Coefficient,  to  which  we  will  assign  the  symbol  RS. 

He  have  already  listed  our  variable  Y(t)  in  chronological  order.  By 
numbering  time  from  1  to  n,  we  have  "ranked"  time  from  the  past  to  the 
future.  He  must  now  rank  our  variable  Y(t)  from  the  smallest  to  the 
largest  value.  Our  rank  assignments  will  also  go  from  1  to  n,  with  the 
smallest  value  of  Y(t)  being  assigned  the  rank  of  ”1",  through  the  largest 
value  of  Y(t)  which  will  receive  a  rank  of  "n".  Our  example  would  look  as 
follows : 

Table  10-5 
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From  the  table  we  need  to  calculate  the  difference  between  the  rank 
assignments  for  time  and  Y(t),  which  we  shall  call  D(t).  Finally,  we  must 
square  each  value  of  D(t)  and  find  the  sum  of  squares,  or  ZD2(t).  Once  we 
have  this  number,  we  are  ready  to  calculate  RS.  The  formula  is: 

RS  =1  -  6*ZD2(t)/n3-n  (Equation  10-4) 


For  our  example,  the  completed  table  would  look  as  follows: 

Table  10-6 


V(t) 

100 

113 
97 
95 

118 

114 
94 

115 
119 


Rank  of  Y(t) 

4 

5 
3 
2 
8 

6 
1 
7 
9 


Time 

1 

2 

3 

4 

5 

6 

7 

8 
9 


D(t) 

3 

3 

0 

-2 

3 

0 

-6 

-1 

0 


D*(t) 

9 

9 

0 

4 

9 

0 

36 

1 

0 


Inserting  E  ( t )  ■  68  and  n  «  9  into  our  equation . 

RS  -  1  -  6*68/93  -  9  -  1  -  408/720  -  1  -  .5667  -  .4333 


ZD2  (t)  -  68 


We  now  know  that  the  Spearman  Rank  Correlation  Coefficient  for  our  data 
is  .4333.  Now  what?  To  determine  whether  or  not  there  is  a  trend,  we  must 
have  a  value  to  which  we  can  compare  our  RS  in  order  to  draw  some  kind  of 
conclusion.  We  will  call  this  value  RS(Critical ) .  A  table  for 
RS (Critical)  is  provided.  Your  table  for  RS( Critical)  is  for  a  one-tailed 
test.  Therefore,  your  hypothesis  is  thai  SS  «  0,  which  means  there  is  no 
trend.  If  RS  >  0,  your  alternative  hypothesis  is  that  there  is  an  upward, 
or  a  positive  trend,  and  if  RS  <  0  your  alternative  hypothesis  is  that 
there  is  a  downward,  or  negative  trend. 

In  our  example,  RS  •  .4333  so  we  ire  testing  the  hypothesis  that  RS  *  0 
against  the  alternative  hypothesis  that  RS  >  0.  The  table  for  RS( Critical) 
is  for  three  levels  of  significance,  a  -  .10,  a  «  .05  and  a  «  .01.  Therefore, 
we  must  decide  at  what  level  of  significance  we  want  to  test  our  hypothe¬ 
sis.  Suppose  we  select  a  ■  .05,  which  is  equivalent  to  a  95%  level  of  con¬ 
fidence.  Then  RS(Critical)  ■  .5833.  If  RS  <  RS ( Critical ) ,  accept  your 
hypothesis.  In  our  example,  .4333  <  .5833,  so  we  must  accept  the  hypothe¬ 
sis  that  RS  ■  0  and  that  there  is  no  trend  in  the  data.  Note,  the  table 
for  RS ( Criteria ) ,  gives  you  values  for  the  positive  side  of  this  probabi¬ 
lity  distribution.  However,  RS  can  take  on  any  value  between  -1  and  +1. 
Therefore,  when  comparing  RS  to  RS(Critical) ,  the  sign  of  RS  should  be 
ignored.  Remember,  however,  that  the  sign  of  RS  determines  your  alter¬ 
native  hypothesis. 
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Spearman's  Rank  Correlation  Coefficient  is  used  to  test  the  hypothesis 
of  no  trend  only  under  the  condition  that  the  trend,  if  one  exists,  is 
monotonic.  In  other  words,  the  observations  used  to  compute  RS  must  meet 
the  following  conditions: 

( 1 )  There  must  be  no  cyclical  effect 

(2)  There  must  be  no  seasonal  effect 

(3)  There  must  be  no  changes  in  trend. 

The  trend  line  must  be  either  continuously  increasing,  continuously 
decreasing,  or  constant. 

Because  of  the  effect  of  random  fluctuations  in  the  data,  the  RS  test 
may  lead  you  to  the  wrong  conclusion  regarding  the  acceptance  or  rejection 
of  your  hypothesis.  The  probability  of  this  occurring  is  measured  by  the 
level  of  confidence  you  select  for  testing  your  hypothesis.  However,  if 
the  three  conditions  specified  above  for  the  observations  are  not  met,  you 
might  as  well  flip  a  coin  to  decide  whether  or  not  there  is  a  trend.  Your 
chances  of  being  right  are  almost  as  good. 

If  there  is  no  trend  in  the  data  our  variable  must  be  fluctuating 
around  a  horizontal  line  with  a  slope  of  zero.  In  other  words,  you  trend 
line  is  Y(t)  *  a.  However,  if  we  determine  that  there  is  a  trend  in  our 
data,  we  must  attempt  to  locate  the  trend  line  in  order  to  project  future 
values  for  our  variable.  To  do  this,  we  need  to  find  the  equation  that 
relates  Y(t)  to  time,  t.  We  have  a  technique  available  to  use  to  find  Y(t) 
-  f{t)  when  f(t)  »  a  +  b*t,  a  straight  line.  The  technique  is  called 
least-squares-best-fit,  hereafter  referred  to  as  LSBF. 

The  equations  for  finding  the  parameters  a  and  b  using  the  LSBF  tech¬ 
nique  when  time  is  the  ingredient  variables  are: 

IY(t)  -  n*a  +  b*Et 

l[Y(t)*t]  -  a*Et  +  b*It2 

Suppose  we  had  the  following  data: 

Table  10-7 


Month (1976)  Y(t) 

Jan  1 1 2 

Feb  127 

Mar  147 

Apr  166 

May  177 

Jun  194 

Jul  217 

Aug  232 

Sep  245 

Oct  268 

Nov  281 

Dec  298 
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Time  is  represented  by  months  of  the  year,  which  makes  it  extremely 
difficult  to  find  r[Y<t)*t],  Et,  and  Et2 •  Therefore,  we  need  to  find  a 
scheme  for  quantifying  time.  In  this  section  we  will  adopt  the  following 
rules.  The  oldest  observation  in  our  data  set  will  be  given  a  value  of 
one,  and  times  periods  will  be  numbered  sequential  in  ascending  order  as 
you  move  toward  the  present.  This  is  not  the  only  scheme  available  for 
quantifying  time  but  it  is  the  only  scheme  that  will  be  applied  in  this 
section.  Our  quantification  of  time  would  then  look  like  Table  10-8. 


Table  10-8 


Y(t) 

ill 

Y(t)*t 

t2 

112 

1 

112 

1 

127 

2 

254 

4 

147 

3 

441 

9 

166 

4 

664 

16 

177 

5 

885 

25 

194 

6 

1164 

36 

217 

7 

1519 

49 

232 

8 

1856 

64 

245 

9 

2205 

81 

268 

10 

2680 

100 

281 

11 

.  3091 

121 

298 

11 

3576 

144 

2464 

78 

18447 

650 

Substituting  EY  -  2464,  Et  -  78,  Et2  -  650,  E[Y(t)*t]  -  18447,  and  n  - 
12  into  our  LSBF  equations  gives  us: 


2464  -  12a  +  78b 
18447  «  78a  +  650b 


Solving  the  above  two  equations  for  a  and  b  we  find  that  a-94.8333  and  b»17. 
We  shall  use  the  symbol  YT  to  represent  our  trjnd  line,  followed  by  the 
parenthesis  (t).  The  symbol  "t"  represents  three  things.  First,  it  is  the 
most  recent  observation  in  our  data  set.  Second,  since  we  have  numbered 
our  observations  sequentially  starting  with  one ,  t  also  equates  to  the 
number  of  observations  in  the  data  set.  Third,  t  represents  the  time 
period  from  which  all  our  forecasts  will  be  made.  Usually,  but  not  always, 
t  will  be  the  current  time  period.  In  our  example,  YT( 12)*94.8333+17*t. 


If  you  remember  our  earlier  symbology,  F(t,t+i)  was  a  forecast  made  in 
time  period  t  for  the  time  period  t+i.  To  designate  a  forecast  based  on  a 
projection  of  the  trend  line,  I  am  going  to  use  the  symbol  FT  followed  by 
the  parenthesis  (t,t+i). 

In  our  example,  the  trend  forecast  would  be  FT( 12, 12+i)  -  94.8333  + 
17*(12+i).  Using  this  equation,  let  us  forecast  the  variable  Y  for  June 
1977,  supposing  we  are  now  in  December  1976.  For  December  1976,  t  *  12. 
June  is  six  periods  into  the  future.  Therefore,  i  -  6  and  t+i  »  18.  Our 
forecast  would  be:  FT(12,18)  -  94.8333  +  17*18  -  94.8333  +  306  -  400.8333 
or  401. 


What  would  our  trend  forecast  be  if  our  test  for  trend  indicates  there 
is  no  trend  in  the  data?  To  answer  this  question,  look  at  the  LSBF 
equations  themselves.  No  trend  means  that  the  slope  of  the  trend  line  is 
zero,  or  b  -  0.  The  first  equation  is  EY(t)  -  n*a  +  b*Et.  If  b  *  0,  then 
EY(t)  *  n*a  or  a  *  EY(t)/n  -  Y(t).  Therefore,  our  forecast  FT(t,t+i)  - 
Y(t)  would  never  change  as  long  as  the  assumptions  of  our  time  series 
analysis  hold  true. 


To  determine  whether  or  not  a  trend  is  nonlinear,  we  must  again  analyze 
the  error  term  E(t)  that  is  obtained  from  our  LSBF  linear  trend  equation. 
Take  the  following  example : 

Table  10-9 


mi 

Month ( 1977) 

t 

139 

Mar 

1 

144 

Apr 

2 

140 

May 

3 

151 

Jun 

4 

165 

Jul 

5 

188 

Aug 

6 

211 

Sep 

7 

246 

Oct 

8 

281 

Nov 

9 

The  first  step  we  would  take  is  to  test  to 
calculating  RS. 

Table  10-10 

see  if  there  is  a  trend 

Y(t)  Rank  t 

d(t) 

d2(t) 

139 

1  1 

0 

0 

144 

2  2 

1 

1 

140 

3  3 

-1 

1 

151 

4  4 

0 

0 

165 

5  5 

0 

0 

188 

6  6 

0 

0 

211 

7  7 

0 

0 

246 

8  8 

0 

0 

281 

9  9 

0 

0 

Ed* (t)  -  2 
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RS  -  1  -  (6*2/(9 ) 3-9)  «1  -(12/720)  -  1  -  .01667  -  .9833 
For  n  ■  9  and  a  *  .01,  FS( Critical)  ■  .7667 

RS  -  .9833  >  RS (Critical)  ■  .7667,  ao  we  reject  the  hypothesis  that  RS 
-  0  and  accept  the  alternative  hypothesis  that  RS  >  0.  We  are  certain  at 
the  99%  level  of  confidence  that  there  is  an  upward  trend. 

Our  next  step  would  be  to  measure  the  linear  trend  by  using  the  LSBF 
equations . 


Table  10-11 


Y(t) 

(t) 

tf 

Y(t)*t 

139 

1 

1 

139 

144 

2 

4 

288 

140 

3 

9 

420 

151 

4 

16 

604 

165 

5 

25 

825 

188 

6 

36 

1128 

211 

7 

49 

1477 

246 

8 

64 

1968 

281 

9 

81 

2529 

1665 

45 

285 

9378 

LSBF:  ZY(t)  -  n*a  +  b*Et 

E[y(t)*t]  -  a*£t  +  b*Et2 

or:  1665  *  9a  +  45b 

9378  ■  45a  +  285b 

Solving  these  two  equations,  we  find  that  a  ■  97.25  and  b  *  17.55. 
Therefore,  YT(9)  -  97.25  +  17.55*t.  We  next  calculate  the  error  term, 
E(YT:9)  -  Y(t)  -  YT(9). 

Table  10-12 


Y(t) 

YT(Mar  77.9) 

E(YT:  Mar  77.9) 

Siqn  +  or 

139 

114.8 

24.2 

+ 

144 

132.35 

11.65 

+ 

140 

149.9 

-9.9 

- 

151 

167.45 

-16.45 

- 

165 

185.00 

-20.00 

- 

188 

202.55 

-14.55 

- 

211 

220.1 

-9.1 

- 

246 

237.65 

8.35 

+ 

281 

255.2 

25.8 

+ 

There  are  two  methods  for  determining  if  the  trend  line  is  nonlinear. 
The  first  method  is  to  look  at  the  signs  (+  or  -)  associated  with  the  error 
term  E(YT).  If  the  signs  are  positive  for  the  end  values  and  negative  for 


the  middle  values/  or  negative  for  the  end  values  and  positive  for  the 
middle  values,  there  is  an  Indication  that  you  have  a  nonlinear  trend.  If 
the  signs  appear  to  be  "randomly”  distributed,  changing  from  positive  to 
negative  and  back  «uid  forth,  your  trend  is  probed) ly  linear.  In  our  example 
the  signs  associated  with  the  end  values  are  positive  and  the  signs  asso¬ 
ciated  with  the  middle  values  are  negative,  indicating  a  nonlinear  trend. 

The  second  method  is  to  plot  the  error  term  against  time.  Since  the 
error  term  takes  on  both  positive  and  negative  values,  your  graph  should  be 
set  up  in  the  following  manner. 


EY(t) 


re  10-5 


If  the  chronological  plot  of  E(YT) looks  like  a  _ ^ "  or  a  "  the 

trend  line  is  probably  nonlinear.  If  a  plot  of  e(t)  crosses  the  t-axis 
several  times  so  the  plot  looks  like  random  fluctuations,  the  trend  is  pro¬ 
bably  linear.  In  our  example,  the  error  terms  plot  line  a  "v _ /"• 

The  following  tables  give  the  relationship  between  the  error  terms  and 
the  nonlinear  form  of  the  trend  line.  The  first  column  specifies  whether 
the  slope  of  the  linear  trend  line  projected  through  the  data  using  LSBF  is 
positive  or  negative.  The  second  column  indicates  whether  the  signs  of  the 
error  terms  are  positive  at  the  ends  and  negative  in  the  middle  (+,-,+)  or 
negative  at  the  ends  and  positive  in  the  middle  (-,+,-).  The  third  column 

indicates  whether  the  error  terms  e{  YT)  look  like  a  "v _ /"  or  like  a  ^ 

when  plotted  against  time.  The  last  column  indicates  how  the  slope  of  the 
nonlinear  trend  is  behaving. 


Table  10-13 


"b"  Sign  of  "E"  Plot  of  ME"  Nonlinear  trend 

(Figure  10-2) 


Positive 

+,-#+ 

Trend  line  C-D 

Trend  line  E-F 

Negative 

Trend  line  I-J 

Trend  line  K-L 

I  an  not  going  to  attempt  to  provide  you  with  the  techniques  necessary  to 
determine  the  equation  for  a  nonlinear  trend  using  LSBF.  Therefore,  if  you 
have  a  trend  line  which  is  nonlinear,  1  give  you  two  choices.  First,  you 
can  draw  a  free-hand  line  through  the  data,  and  attempt,  based  on  a  visual 
analysis,  to  graphically  project  the  line  to  where  you  think  it  appears  to 
be  going.  Second,  you  can  select  the  most  recent  data  points  and,  using 
LSBF ,  fit  a  straight  line  to  the  data  and  project.  If  you  do  not  project 
too  far  out  into  the  future,  and  the  curvature  of  your  nonlinear  trend  line 
is  relatively  flat,  the  error  in  your  projection  should  not  be  excessive. 

The  LSBF  line  is  not  very  sensitive  to  the  addition  of  new  obser¬ 
vations.  This  lack  of  sensitivity  to  new  observations  has  a  twofold 
effect.  First,  our  LSBF  line  will  respond  very  slowly  to  changes  in  trend. 
Second,  our  LSBF  line  will  remain  relatively  unaffected  by  random  fluc¬ 
tuations  in  the  data.  If  we  find  ourselves  in  a  situation  triiere  stability 
in  our  forecast  is  more  important  than  an  ability  to  respond  to  changes  in 
trend,  a  LSBF  trend  line  is  a  reasonable  technique  upon  which  to  base  our 
forecast.  However,  if  the  reverse  is  true  and  the  ability  to  respond  to 
changes  in  trend  gains  greater  relative  importance,  a  LSBF  trend  line  will 
not  meet  our  needs.  What  to  do?  Eliminating  older  observations  as  we  add 
new  observations  increases  the  sensitivity  of  our  forecasting  technique  to 
new  observations  which,  in  turn,  results  in  greater  sensitivity  to  changes 
in  trend.  This  is  the  basic  idea  behind  the  family  of  forecasting  tech¬ 
niques  based  on  moving  averages.  However,  before  we  begin,  we  need  to 
acquaint  ourselves  with  some  new  symbology. 

M1(n,t)  is  a  single  n-period  moving  average  calculated  in  time  period  t. 
M2(n,t)  is  a  double  n-period  moving  average  calculated  in  time  period  t. 


FM1(n,t,t+i)  is  «  single  n-period  moving  average  forecast  made  in  time 
period  t  for  time  period  t+i. 


FM2(n,t,t+i)  is  a  double  n-period  moving  average  forecast  made  in  time 
period  t  for  time  period  t+i . 


A  single  n-period  moving  average  for  time  period  t  is  the  average  value 
of  the  last  n  observations  of  a  data  set  that  ends  at  time  period  t.  As 
you  can  see,  verbally  describing  a  single  n-period  moving  average  is  very 
much  like  describing  a  spiral  staircase  without  using  your  hands.  Perhaps 
we  can  gain  better  insight  into  Ml (n,t)  by  looking  at  the  way  a  single  n- 
period  moving  average  is  calculated.  The  formulas  for  Ml (n,t)  is: 


Ml (n,t)  -  Y(t)  +  Y(t-i)  +  Y(t-2)  +.  .  .+  Y(t-n+1)  (Equation  10-5) 

n 

The  summation  in  the  numerator  starts  with  the  observation  that 
corresponds  to  the  time  period  for  which  we  are  calculating  the  moving 
average.  There  are  also  n  observations  in  the  numerator.  In  our  symbology 
Ml (n,t) ,  t  determines  the  first  observation  to  be  included  in  the  moving 
average,  and  n  determines  the  number  of  observations  to  be  included.  For 
example,  suppose  we  were  to  analyze  the  following  data: 


Table  10-14 

1  74 

2  94 

3  84 

4  71 

5  88 

6  81 

7  77 

8  91 

9  87 

10  80 


Before  calculating  M1(n,t)  we  must  decide  how  many  observations  we  want 
to  include  in  our  moving  average.  In  fancier  language,  n  is  a  decision 
parameter  that  must  be  selected  by  the  forecaster  based  on  factors  that 
will  be  discussed  later.  For  the  purpose  of  our  examples,  let  n  *  3. 

We  cannot  calculate  M1(3,1)  or  M1(3,2)  because  we  do  not  have  a  suf¬ 
ficient  number  of  data  points.  The  first  n  -  period  moving  average  we  can 
calculate  is  Ml (3, 3). 
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Using  equation  10-5  M1<3,3>  -  Y(3)+Y(2)+Y( 1 )  .  84  +  94  +  74  «  252  .  g4 

3  3  3 


To  calculate  Ml (3,4)  we  drop  the  oldest  observation  in  the  numerator, 

Y ( 1 )  -  74,  and  add  the  newest  observation ,  Y(4)  -  71.  Using  equation  3-1, 
M1(3,4)  -  Y(4)+Y(3)+Y(2)  „  88  +  71  +  84  ,  243  _  „3 

3  3  3 

and 

Ml (3,5)  -  Y(5)+Y(4)+Y(3)  88  +  71  +  84  243 

3  *  3  ”3 

I  will  leave  it  to  you  to  calculate  M1(3,t)  for  t  *  6  to  t  *  10.  Your 
final  results  should  look  like  Table  10-15. 

Table  10-15 


M1(3,7) 


1 

74 

- 

2 

94 

- 

3 

84 

84 

4 

71 

83 

5 

88 

81 

0 

81 

80 

7 

77 

82 

8 

91 

83 

9 

87 

85 

10 

80 

86 

The  formula  for  a  single  n-period  moving  average  can  be  modified 
slightly  into  a  format  that  makes  it  more  workable  when  n  is  large.  We 
could  write  Ml ( 3 , 5 )  as  follows . 

M1(3,5)  -  Y ( 5 ) +Y ( 4 ) +Y ( 3 )  Y(5)  Y(4)  Y(3) 

3  "33  3 

Y  (2) 

If  we  both  add  and  subtract  3  from  the  above  equation,  nothing  changes. 
The  two  terms  cancel  each  other  out.  The  equation  would  then  read: 

mk3,5)  -  mi  +  mi  +  mi  +  mi  -  mi 

33333 


However,  the  middle  three  terms  represent  the  3-period  moving  average  for 
time  period  4,  Ml (3,4).  We  can  rewrite  the  equation: 


M1(3,5)  -  mi  +  Ml  (3,4)  -  Hi!  -  M1(3,4)  +  YCS)-Y(2) 
3  3  3 


We  can  calculate  Ml (3, 5)  by  adding  to  Ml (3, 4)  one  third  of  the  difference 
between  the  new  added  observation,  Y(5),  and  the  old  deleted  observation, 
Y( 2) . 


In  general 


M(n,t)  -M(n,t-1)  +  X.(*>.  - 


Equation  10-6) 


In  our  example  we  calculated  Ml (3,3)  “  84.  Using  equation  10-6. 
M1(3,4)  -  Ml (3/3)  +  ,  84  +  71  ~  74  ■  84  -  1  -  83  and 


M  (3,5)  -  Ml ( 3 ,4)  + 


Y(5)-Y(2) 


83  + 


88  -  94 


83  -  1 


81 


When  n  is  small  it  makes  little  difference  whether  you  use  equation  10-5 
or  equation  10-6  to  calculate  your  moving  average.  However,  as  n  gets 
large,  use  of  equation  10-6  can  save  you  a  tremendous  amount  of  time  and 
effort.  If  you  don't  believe  me,  calculate  M1(6,t)  for  t  -  6  to  t  -  10 
from  the  date  in  Table  10-14  using  both  equations.  You  will  become  a  believer. 


We  now  know  how  to  calculate  a  single  n-period  moving  average  for  time 
period  t.  However,  suppose  we  are  in  time  period  t  and  we  want  to  forecast 
time  period  t+i.  What  do  we  do?  If  you  are  going  to  forecast  using  a 
single  n-period  moving  average,  the  relationship  between  the  forecast  and 
the  moving  average  is  a  simple  one. 

FMI(n,t,t+i)  -  Ml (n,t)  (Equation  10-7) 


Our  single  moving  average  forecast  for  time  period  t+i  is  nothing  more 
than  our  single  moving  average  for  time  period  t*  Going  back  to  Table 
10-15,  suppose  we  wanted  to  make  a  forecast  for  two  periods  into  the 
future.  Then  our  forecast  interval,  i  «  2.  Our  forecasts  FMI(3,t,t+2)  and 
the  error  in  our  forecasts  EFMI:3,t,t+2)  are  shown  in  Table  10-16. 


Table  10-16 


t 

Y(t) 

Ml(3,t) 

FMI(3,t,t+2) 

EFMI:  3,' 

T 

74 

- 

- 

- 

2 

94 

- 

- 

- 

3 

84 

84 

- 

- 

4 

71 

83 

- 

- 

5 

88 

81 

84 

4 

6 

81 

80 

83 

-2 

7 

77 

82 

81 

-4 

8 

91 

83 

80 

11 

9 

87 

85 

82 

5 

10 

80 

86 

83 

-3 

The  sign  of  our  error  terms  appears  to  be  random.  Our  statistics  for 
measuring  forecasting  accuracy  would  be: 


Jinmi  .  2|  .  4-833 

n  6 

s  , |/L  -  V31.8333  -  5.642 


*> 
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When  calculating  our  statistics  we  let  n  ■  6  because  there  are  only  six 
periods  for  which  we  have  forecasts  even  though  ten  observations  were  used 
in  making  our  calculations. 

Is  there  a  trend  in  our  data?  To  answer  this  question  we  must  calcu¬ 
late  the  Spearman  Rank  Correlation  Coefficient. 


Table  10-17 

t 

Y(t) 

Rank  Y(t) 

D 

1 

74 

2 

-1 

1 

2 

94 

10 

-8 

64 

3 

84 

6 

-3 

9 

4 

71 

1 

3 

9 

5 

88 

8 

-3 

9 

6 

81 

5 

1 

1 

7 

77 

3 

4 

16 

8 

91 

9 

-1 

1 

9 

87 

7 

2 

4 

12. 

80 

_4 

_6 

36 

150 

RS  -  1  - 

6*150 

=  1  -  1  -  .9091 

*  .0909 

10d-10 

990 

RS( Critical)  for  n=*10  and  an  a  *  .05  is  .5515.  Since  RS  <  RS( Critical)  we 
can  accept  the  hypothesis  that  RS  *  0  and  be  95%  confident  of  having  made 
the  right  choice.  We  are  95%  certain  that  there  is  no  trend  in  the  data. 

Would  it  have  made  any  difference  in  our  forecasts  if  there  had  been 
a  trend  in  the  data?  To  answer  this  question  let  us  look  at  another  set  of 
data. 


Table  10-18 

t  Y(t) 

1  111 

2  118 

3  122 

4  120 

5  124 

6  128 

7  123 

8  130 

9  134 

For  the  data  in  Table  10-18,  RS  *  .9333.  RS(Critical)  for  n  *  9  and 
a  *  .01  is  .7667.  Since  RS  is  positive  and  greater  than  RS ( Critical ) ,  we 
reject  the  hypothesis  that  RS  ■  0  and  accept  the  alternative  hypothesis  that 
RS  >  0.  We  are  99%  certain  that  there  is  an  increasing  trend  in  the  data. 


Let  us  calculate  a  3-period  single  moving  average  forecast  for  two 
periods  into  the  future. 

Table  10-19 


t 

Y(t) 

Ml ( 3 ,t) 

FM1 ( 3,t,t+2) 

E( FM1 :  3,t,t+2) 

1 

111 

- 

- 

- 

2 

118 

- 

- 

- 

3 

122 

117 

- 

- 

4 

120 

120 

- 

- 

5 

124 

122 

117 

7 

6 

128 

124 

120 

8 

7 

123 

125 

122 

1 

8 

130 

127 

124 

6 

9 

134 

129 

125 

9 

The  error  term  for  our  forecast  is  always  positive.  This  means  our 
forecasts  are  consistently  too  low.  What  is  the  source  of  this  bias? 
Because  of  the  upward  trend  in  the  data  the  observations  used  to  calculate 
the  single  moving  average  are  usually  smaller  than  the  observations  we 
expect  to  see  in  the  future.  Therefore,  when  the  moving  average  is  used  as 
a  forecast  of  the  future,  we  would  expect  our  forecast  to  usually  be  too 
low.  If  our  trend  line  was  decreasing  over  time,  our  single  moving  average 
forecast  would  usually  be  too  high,  and  our  terms  would  be  negative. 

The  magnitude  of  the  bias  in  our  forecasts  depends  on  n,  i,  and  the 
slope  of  the  trend  line.  The  steeper  the  slope  of  the  trend  line,  the 
greater  the  bias  in  your  forecast.  The  greater  the  number  of  observations 
in  your  single  moving  average  calculation,  the  greater  the  bias  in  your 
forecast.  The  greater  the  number  of  periods  out  into  the  future  you  are 
trying  to  forecast,  the  greater  the  bias  in  your  forecasts.  Conversely, 
the  flatter  the  slope  of  the  trend  line,  the  smaller  the  value  for  n,  and 

the  smaller  the  value  for  i,  the  smaller  the  bias  in  your  forecast. 

However,  if  you  have  a  trend  in  your  data,  a  single  moving  average  forecast 

will  always  produce  biased  forecasts* 

If  you  have  a  trend  in  your  data  you  may  still  be  able  to  forecast  with 
a  single  moving  average  provided  that  the  amount  of  the  bias  is  relatively 
consistent.  In  other  words,  the  bias  in  the  forecast  must  be  relative 
stable  over  time.  The  forecast  error  term  is  a  combination  of  the  amount 
of  bias  in  the  forecast  and  random  fluctuations  in  the  data.  If  there  is 
no  bias  in  the  forecast,  the  error  term  measures  only  the  random  fluc¬ 
tuations,  and  the  sum  of  the  error  terms  should  equal  zero.  However,  if 
there  is  bias  in  the  forecast  the  sum  of  the  error  terms  should  be  positive 
or  negative  depending  on  whether  the  bias  is  positive  or  negative.  Our 
beat  estimate  of  the  average  valuta  of  our  bias,  therefore,  is  the  average 
value  of  our  error  terras,  or  E.  Once  we  have  a  measure  of  our  forecast 
bias  we  can  adjust  our  forecast  by  the  amount  of  the  bias. 

31 

For  example,  using  the  data  from  Table  10-19,  11-5  3  6.2.  Therefore, 

our  forecast  adjusted  for  bias  would  be  FM1(3,t,t+2)  +  6.2.  Our  forecasts 
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fl 


would  then  look  like  Table  10-20. 


Table  10-20 


t 

Y(t) 

FM  1(3,t,t+2)+6.2 

E(FM1 :  3 , t , 

1 

111 

- 

2 

118 

- 

3 

122 

- 

4 

120 

5 

124 

123.2 

.8 

6 

128 

126.2 

1.8 

7 

123 

128.2 

-5.2 

8 

130 

130.2 

-.2 

9 

134 

131.2 

3.8 

Note  that  by  adjusting  our  single  moving  average  forecast  by  E,  the  sum  of 
the  resulting  error  terms  will  always  add  to  zero,  a  condition  that  implies 
no  forecasting  bias. 

Since  we  cannot  rely  on  our  single  moving  average  forecasts  for  all 
cases  when  there  is  a  trend  in  the  data,  what  are  we  to  do?  If  the  trend 
is  linear  you  can  still  use  moving  averages  for  forecasting,  but  instead  of 
using  single  moving  averages  you  must  use  double  moving  averages.  What  is 
a  double  moving  average?  A  double  moving  average  is  a  moving  average  of 
single  moving  averages.  Remember  the  formula  for  a  single  moving  average. 


Ml (n,t)  -  Y(t)  +  Y(t-1)  +  Y(t-2)  + 

n 


+  Y( t-n+1 ) 


(Equation  10-5) 


If  you  were  to  substitute  single  moving  averages  in  place  of  individual 
observations  in  the  above  equation,  you  would  have  a  double  moving  average. 

M2(n,t)  -  Ml (n,t)  +  M1(n,t-1)  +  M1(M,t-2)  +♦..+  M1(n, t-n+1)  (Equation  10-8) 

n 

To  see  how  we  would  calculate  double  moving  averages,  let  us  look  again  at 
the  data  used  to  calculate  the  single  moving  averages  in  Table  10-19. 

Table  10-21 


t 

Y(t) 

Ml  ( 3 

1 

111 

- 

2 

118 

- 

3 

122 

117 

4 

120 

120 

5 

124 

122 

6 

128 

124 

7 

123 

125 

8 

130 

127 

9 

134 

129 
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Our  single  moving  average  is  a  3-period  moving  average,  so  our  double 
moving  average  must  also  be  a  3-period  moving  average.  M2 (3, 5)  is  the 
first  double  moving  average  we  can  calculate,  since  time  period  5  is  the 
first  time  period  for  which  we  have  three  single  moving  averages.  Using 
Equation  10-8. 

M2 ( 3 , 5 )  -  Ml (3,5)  +  M1(3,4)  +  MH3.3)  -  122  +  120  +  117  -  359  -  119.667 

3  3  3 

To  calculate  Ml (3,6)  we  need  to  change  the  summation  in  the  numerator  by 
deleting  Ml (3,3)  and  adding  Ml (3,6).  The  equation  for  M2 (3, 6)  then 
becomes : 

M2 (3, 6)  *  Ml (3,6)  +  Ml (3,5)  +  M1(3,4)  -  124  +  122  +  120  -  366  =  122 

3  3  3 

I  will  leave  it  to  you  to  calculate  M2(3,t)  for  t  *  7  to  t  *  10.  Your 
results  should  look  like  Table  10-22. 


t 

Y(t) 

Table  10-22 

M1(3,t) 

M2 ( 3 , t ) 

1 

111 

- 

- 

2 

118 

- 

- 

3 

122 

117 

- 

4 

120 

120 

• 

5 

124 

122 

119.667 

6 

128 

124 

122.000 

7 

123 

125 

123.667 

8 

130 

127 

125.333 

9 

134 

129 

127.000 

Equation  10-5  can  also  be  used  to  calculate  double  moving  averages  by 
changing  the  individual  observations  to  single  moving  averages. 

M2(n,t)  «  M2(n,t-1)  +  MKn,t)_^  Ml_(nft  n )  (Equation  10-9) 

For  example,  the  double  moving  averages  in  Table  10-21  could  have  been 
calculated  as  follows: 

M2 (3,6)  -  119.667  +  124~117  -  119.667  +  2.333  -  122 
and 

125-120 

M2 1  3,7)  »  122  +  - - - -  122  +  1.667  -  123.667 


Again,  for  a  luge  n,  equation  10-9  can  save  you  substantial  com¬ 
putational  time  compared  to  equation  10-8. 


Two  other  observations  are  worth  noting.  First,  it  takes  more  data 
points  to  calculate  a  double  moving  average  than  it  does  to  calculate  a 
single  moving  average.  A  n-period  single  moving  average  requires  a  minimum 
of  n  observations,  while  a  n-period  double  moving  average  requires  a  mini¬ 
mum  of  2*n-1  observations. 

Second,  when  there  is  a  trend  in  the  data  the  double  moving  average 
will  lag  behind  the  trend  line  even  more  than  the  single  moving  average. 

For  example,  you  can  see  in  Table  10-22  that  when  there  is  an  increasing 
trend,  M1(n,t)  lags  behind  the  trend.  Furthermore,  M1(n,t)  becomes  smaller 
as  you  move  backward  in  time.  When  calculating  M2(n,t)  the  numerator  would 
consist  of  the  sum  of  the  most  recent  single  moving  average  and  n-1  earlier 
single  moving  averages  smaller  in  value.  Consequently,  the  average  of 
these  single  moving  averages,  M2(n,t),  would  be  smaller  than  the  most 
recent  single  moving  average,  M1(n,t).  When  the  trend  is  increasing 
M2(n,t)  <  M1(n,t).  Conversely,  when  the  trend  is  decreasing  M2(n,t)  > 
M1(n,t).  When  would  M2(n,t)  =*  M1(n,t)?  When  there  is  no  trend  and  Ml(n,t) 
and  M2(n,t)  are  both  measuring  only  the  random  fluctuation  in  the  data. 

When  we  based  our  forecast  on  a  single  moving  average  we  used  the  rela¬ 
tionship  FMI(n,t,t+i)  =*  M1(n,t).  However,  we  saw  that  single  moving  avera¬ 
ges  gave  us  biased  forecasts  when  our  data  contained  a  linear  trend. 

The  equation  for  a  straight  line  is  Y  -  a+b*x.  Since  we  are  trying  to 
forecast  a  linear  trend,  it  would  be  reasonable  to  assume  that  the  rela¬ 
tionship  between  our  double  moving  average  forecast  and  our  double  moving 
average  is  also  linear.  In  fact,  a  double  moving  average  forecast  depends 
on  both  the  double  moving  average  and  the  single  moving  average.  The  rela¬ 
tionship  is: 

FM2(n,t,t+i)  •  a(t)+b(t)*i  ( Equation  10-10) 

where 

a(t)  *  2*M1(n,t)  -  M2(n,t)  (Equation  10-11) 

a(t)  *  — [M1(n,t)  -  M2(n,t)]  (Equation  10-12) 


The  double  moving  average  forecast  is  a  linear  trend  line  where  the 
parameters  a(t)  and  b(t)  are  determined  by  the  values  of  M1(n,t)  and 
M2(n,t) . 

Using  the  values  calculated  in  Table  10-22,  our  forecasting  equation  for 
time  period  t*5  would  be: 

a(t)  -  2*M1 (3,5)  -  M2 ( 3 , 5 )  -  2*122  -  199.667  -  124.333 

b(t)  -  — [Ml  (3,5)  -  M2(3,5)l  -  _JL  [122  -  199.667]  -  2.333 
n-1 L  J  3-i 


FM2 (3,5, 5+i )  -  124.333  +  2.333*i 


Using  the  forecasting  equation,  our  forecast  for  time  period  9,  4  time 
periods  into  the  future,  would  be  FM2(3,5,9)  -  124.333  +  2.333*4  -  133.665. 

Our  forecasting  equations  for  time  period  t  ■  6  would  be: 

a(t)  -  2*124  -  122  -  126 

b(t)  -  *  [124  -  122]  -  2 

FM2 (3,6, 6+i )  -  126+2*i 

Our  forecast  of  time  period  t  ■  9  from  time  period  t  *  6  would  be 
PM2( 3,6,9)  «  126+2*3  -  132. 

As  we  move  closer  and  closer  to  time  period  t  *  9  our  forecast  will 
continually  change  as  we  adjust  our  estimates  for  the  linear  parameters  a 
and  b  for  the  new  observations  to  our  data. 

What  would  happen  if  we  were  to  base  our  forecasts  on  double  moving 
averages  when  the  data  actually  contains  no  trend?  To  answer  this 
question,  let  us  look  at  the  equations  for  a(M2)  and  b(M2). 

a(t)  -  2*M1 (n,t)  -  M2(n,t)  (Equation  10-11) 

b(t)  -  jjTj-  *[Ml(n,t)  -  M2(n,t)]  (Equation  10-12) 

If  there  is  no  trend  in  the  data,  M1(n,t)  and  M2(n,t)  should  theoreti¬ 
cally  be  equal.  In  actual  fact  they  probably  will  not  be  exactly  equal 
because  of  the  effects  of  random  fluctuations  in  the  data.  Assuming  that 
Ml(n,t)  does  equal  M2(n,t)  what  happens  to  Equation  10-11  and  10-12.  Equation 
10-12,  the  equation  for  b(t),  consists  of  a  multiplication  of  two  numbers, 
one  of  which  is  the  difference  between  M1(n,t)  and  M2(n,t).  If  Ml(n,t)  - 
M2(n,t),  then  M1(n,t)  -  M2(n,t)  -  0  and  Equation  10-12  will  equal  zero.  The 
trend  line  will  have  no  slope,  which  is  consistent  with  our  initial  state¬ 
ment  that  the  data  contained  no  trend.  Equation  10-11,  the  equation  for 
a(t),  consists  of  the  difference  between  twice  M1(n,t)  and  M2(n,t).  If 
M1(n,t)  *  M2(n,t),  then  substituting  M1(n,t)  for  M2(n,t)  in  the  equation 
results  in  a(t)  ■  M1(n,t).  Since  b(t)  *  0,  the  equation  for  a  double 
moving  average  forecast  becomes: 

FM2(n,t,t+i)  -  a(t)  +  b(t)  *i  -  a(t)  -  M1(n,t)  -  FM1(n,t,t+i) 

When  there  is  no  trend  in  the  data,  our  double  moving  average  forecast 
degenerates  into  a  single  moving  average  forecast. 

If  there  is  an  increasing  trend  in  the  data,  then  M1(n,t)  >  M2(n,t)  and 
M1(n,t)  -M2(n,t)  will  be  positive.  Consequently,  b(t)  will  be  positive. 
Conversely,  if  there  is  a  decreasing  trend  in  the  data  then  Ml(n,t)  <  M2(n,t) 
and  M1(n,t)  -  M2(n,t)  will  be  negative.  Consequently,  b(t)  will  be  nega¬ 
tive.  The  relationship  between  M1(n,t)  and  M2(n,t),  determined  by  the 
direction  and  magnitude  of  trend  present  in  the  data,  in  turn  determines 
the  sign  of  the  slope  of  the  double  moving  average  forecast  equation. 
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If  we  rewrite  the  equation  for  a(t)  as  a(t)  -  M1(n,t)  +  M1(n,t)  -  M2(n,t) 
we  can  better  see  the  relationships  between  a(t)  and  the  moving  averages.  If 
the  trend  is  increasing  M1(n,t)  will  fall  behind  the  trend  line  in  value. 

But  M1(n,t)  -  M2(n,t)  corrects  for  the  lag.  Conversely,  if  the  trend  line 
is  decreasing,  Ml(n,t)  will  exceed  the  trend  line  in  value.  But  M1(n,t)  - 
M2(n,t)  will  be  negative,  and  adding  this  negative  value  to  M1(n,t) 
corrects  for  the  lag. 


If  we  were  using  our  forecast  equation  FM2(n,t,t+i)  ■  a(t)  +  b(t)  *i  to 
forecast  the  present,  i  would  equal  zero  and  FM2(n,t,t)  -  a(t).  a(t)  is  a 
double  moving  average  forecast  of  the  present  from  the  present.  As  we  move 
through  time,  a(t)  would  have  to  be  adjusted  to  account  for  the  trend  in 
our  data.  The  amount  of  the  adjustment  is  determined  by  the  relationship 
between  m1(n,t)  and  M2(n,t).  a(t)  represents  the  y- intercept  of  a  straight 
line  equation  as  the  y-axis  is  being  shifted  along  the  time  axis  to  repre¬ 
sent  the  passage  of  time.  Figure  10-6  depicts  the  phenomenon  of  our  moving 
a(  t) . 


Figure  10-6 
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Figure  10-6  also  implies  that  as  the  present  moves  forward  in  time  by 
one  period,  the  change  in  a(t)  should  equal  the  slope  of  the  line.  In 
other  words,  a(Mar)  “  a (Feb)  +  b  and  a(Apr)  *  a (Mar)  +  b.  This  phenomenon 
occurs  because  we  are  always  forecasting  from  the  present;  the  present 
always  corresponds  to  i*o  in  our  double  moving  average  forecast  equation. 


When  we  applied  our  single  moving  average  forecast  to  data  with  a 
linear  trend  our  forecast  was  consistently  biased.  A  single  moving  average 
forecast  assumes  a  trend  line  with  a  slope  of  zero.  Likewise ,  our  double 
moving  average  forecast  assumes  a  linear  trend  line.  If  our  trend  line  is 
nonlinear,  our  double  moving  average  forecast  will  also  be  biased. 

Furthermore,  our  double  moving  average  forecast  will  move  further  and 
further  away  from  the  nonlinear  trend  line  as  the  forecast  interval 
increases.  We  do  not  have  a  forecasting  technique  employing  moving  avera¬ 
ges  that  can  be  used  to  forecast  nonlinear  trends.  However,  you  can  use  a 
double  moving  average  forecast  for  a  nonlinear  trend  provided  that:  (a)  your 
forecast  interval  is  relatively  short,  and  (b)  the  curvature  of  your  nonlinear 
trend  line  is  relatively  flat.  Under  these  conditions  a  straight  line  can  be 
used  as  a  reasonable  approximation  of  a  nonlinear  trend. 

Throughout  this  section  we  have  been  assigning  values  for  n  on  a  rather 
arbitrary  basis.  We  had  stated  earlier  that  the  number  of  observations  to 
include  in  the  calculation  of  the  moving  average  was  a  decision  parameter 
for  the  forecaster.  Now  we  need  to  ask,  what  is  the  criteria  for  selecting  n? 

We  defined  flexibility  as  the  capability  of  a  forecasting  technique  to 
respond  to  changes  in  the  primary  cause  and  effect  relationships,  i.e., 
changes  in  trend.  For  a  forecasting  technique  to  be  flexible,  it  would 
have  to  be  responsive  to  the  most  recent  observations. 

Moving  average  forecasts  are  flexible  because  the  moving  average  fore¬ 
cast  is  continuously  discarding  older  data  and  emphasizing  new  obser¬ 
vations.  The  faster  old  data  is  discarded,  the  greater  the  influence  new 
observations  will  have  on  the  calculation  of  the  forecasts,  and  the  more 
flexible  our  moving  average  forecast  will  be.  Therefore,  the  criteria  of 
flexibility  seems  to  dictate  that  old  data  should  be  discarded  as  rapidly 
as  possible  as  new  observations  we  entered  into  the  data  bank.  In  other 
words,  n  should  be  as  small  as  possible.  However,  we  also  saw  that  a  fore¬ 
casting  technique  that  is  responsive  to  new  observations  will  also  be  very 
sensitive  to  random  fluctuations  in  the  data.  There  is  a  tradeoff.  If  you 
want  to  be  able  ?o  respond  quickly  to  changes  in  trend,  you  have  to  be 
willing  to  accept  relatively  unstable  forecasts.  If  you  want  stable  fore¬ 
casts  relatively  unaffected  by  random  fluctuations  in  the  data,  you  should 
select  a  large  n. 

The  value  you  select  for  n  will  depend  on  three  things:  (1)  your 
expectation  that  the  future  will  bring  a  change  in  trend,  (2)  the  cost  or 
loss  to  you  if  you  fail  to  forecast  a  change  in  trend,  and  (3)  your  ability 
to  tolerate  instability  in  your  forecasts. 

As  you  may  have  come  to  suspect,  there  is  no  one  right  answer  to  the 
problem  of  selecting  n.  The  higher  your  expectation  that  the  future  will 
bring  a  change  in  trend,  the  greater  your  risk  of  loss  if  you  fail  to 
respond  to  a  change  in  trend,  and  the  greater  your  tolerance  for  forecasts 
subject  to  random  fluctuations,  the  smaller  the  n  you  can  select.  However, 
if  the  trend  line  does  not  change,  using  a  small  n  may  result  in  larger 
forecasting  errors  because  of  the  influence  of  random  fluctuations  in  the 
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data.  Your  n  must  not  be  bo  small  that  every  time  you  add  a  new  obser¬ 
vation  your  forecast  is  subject  to  wide  oscillations  between  extreme 
values.  Alternatively,  your  n  must  not  be  so  large  that  a  series  of  new 
observations  identifying  a  new  trend  would  have  little  impact  on  your  fore¬ 
cast.  A  rule  of  thumb  is  than  n  should  be  no  smaller  than  and  no 
larger  than  the  smallest  n  required  to  meet  your  minimum  requirement  for 
stability  in  your  forecasts. 

Two  essential  characteristics  of  a  moving  average  forecast  are:  (1)  the 
forecast  is  based  on  a  limited  number  of  observations,  and  (2)  every  obser¬ 
vation  has  an  equal  influence  upon  the  forecast  because  every  observation 
is  given  equal  weight.  Note,  it  is  the  characteristic  of  equal  weighting 
of  the  observations  that  prevents  us  from  increasing  the  number  of  obser¬ 
vations  without  a  corresponding  loss  of  responsiveness  in  our  forecasts. 

What  we  need  is  a  systematic  scheme  for  assigning  unequal  weights  to  our 
observations  in  a  manner  that  allows  us  to  use  more  data  as  the  basis  for 
our  forecast  without  losing  responsiveness. 

One  possible  mathematical  scheme  for  assigning  weights  is  called 
exponential  smoothing.  Exponential  smoothing  is  a  technique  for  assigning 
exponentially  declining  weights  to  the  observations  to  produce  an  exponen¬ 
tial  average.  Therefore,  an  exponential  average  is  a  weighted  average  that 
employs  exponential  freighting.  We  will  discuss  single  and  double  exponen¬ 
tial  averages  and  their  use  in  forecasting.  Along  with  these  new  tech¬ 
niques  we  need  to  introduce  some  new  symbology. 

c  is  the  exponential  smoothing  constant. 

E1(c,t)  is  a  single  exponential  average  calculated  for  time  period  t 
with  smoothing  constant  c. 

E2(c,t)  is  a  double  exponential  average  calculated  for  time  period  t 
with  smoothing  constant  c. 

FE1(c,t,t+i)  is  a  single  exponential  forecast  made  in  time  period  t  for 
time  period  t+i  using  smoothing  constant  c. 

FE2(c,t,t+i)  is  a  double  exponential  forecast  made  in  time  period  t  for 
time  period  t+i  using  smoothing  constant  c. 

We  stated  in  the  introduction  to  this  chapter  than  an  exponential 
average  is  a  weighted  average  that  employs  exponential  weights.  A  weighted 
average  of  two  observations  would  be  TJ7"W( 1 )*Y( 1 )+W(2)*Y(2) ,  where  W(1)+W(2)*1 
The  equations  for  all  exponential  averages  are  equivalent  to  this  two- 
observation  weighted  average. 

Let  us  look  first  at  the  equation  for  a  single  exponential  average. 

Two  substitutions  are  required.  First,  substitute  the  observed  value  of 
your  variable  for  the  time  period  in  which  you  are  calculating  the  exponen¬ 
tial  average  in  place  of  Y(1).  Let  Y(1)-Y(t).  Next,  substitute  the  single 
exponential  average  for  the  previous  time  period  in  place  of  Y(2).  Let 


Y(2)«E1(c,t-1 ) .  What  about  the  weights?  Let  W(1)-c,  our  exponential 
smoothing  constant.  Since  your  weights  should  sum  to  one,  let  W(2)*1- 
W(1)-1-c.  Our  weighted  moving  average  equation  now  becomes  the  equation 
for  a  single  exponential  average. 

B1(c,t)«c*Y(t)+( 1-c)*El(c,t-1 )  (Equation  10-1.,) 

A  single  exponential  average  is  a  weighted  average  of  our  most  recent 
observation  and  the  previous  single  exponential  average. 

But  where  is  the  exponential  weighting?  There  are  no  exponential 
weights  in  Equation  10-13.  To  observe  the  exponential  weighting  present  in 
Equation  10-13  we  must  first  expand  the  equation  by  several  terms.  Our  ori¬ 
ginal  equation  is: 

E1(c,t)-c+Y(t)+(1-c)*El(c,t-1) 

But  Equation  10-13  tells  jus  that: 

El (c,t-1 )*c*Y(t-1 )+( 1-c)*E1 ( c, t-2) 

Substituting  into  the  first  equation  we  get 

El(c,t)-c*Y(t)+(1-c)* [c*Y(t-1)+(1-c)*El(c,t-2)J 


or 

E1(c,t)-C*Y(t)+C*( 1-c)*Y(t-1 )+( 1-c) 2*E1 (c,t-2) 
But  Equation  10-lJ  tells  us  that: 


E1(c,t-2)-c*Y(t-2)+(1-c)+El(c,t-3) 


I 


Substituting  again  we  get 


El(c,t)«c*Y(t)+cM  1-c)*Y(t-1)+(1-c)2*  [  c*Y(t-2)  +  El(c,t-3)j  or 
E1(c,t)-c*Y(t)+c*( 1-c)*Y(t-1)+c*( 1-c)2*Y(t-2)+(1-c)3*E1(c,t-3) 


o 

Since  (1-c)  *1,  we  can  imagine  that  the  coefficient  for  Y(t)  is  really 

c*( 1-c)  .  Making  this  assumption.  Table  10-23  presents  each  of  the  obser¬ 
vations  in  the  above  equation  and  its  corresponding  coefficient. 


Table  10-23 


Observation 


Coefficient 


Y(t)  c*(1— c)° 

Y(  t-1  )  C*(1-C)J 

Y(t-2)  c*(1-c)2 


I 


,1 
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Can  you  guess  what  the  coefficient  for  observation  Y(t-3)  would  be?  From 
thg  pattern  which  appears  in  our  coefficients,  your  guess  should  be  c*(1- 
c).  If  you  were  to  substitute  Equation  10-13  for  El (c,t-3)  into  the  above 
equation  you  would  get 

El(c,t)«c*Y(t)+c*(1-c)*Y(t-1)+c*(1-c)2*Y(t-2)+c*(1-c)3*Y(t-3)+(1-c)4*E1(c,t-4) 


Your  guess  would  have  been  correct.  In  fact,  the  coefficient  for  the 
general  observation  Y(t-k)  in  the  above  equation  would  be  c*(1-c)^.  As  you 
can  see,  the  weight  for  each  observation  changes  because  of  changes  in  the 
value  of  the  exponent,  k.  Hence,  exponential  weighting.  Each  time  we 
expanded  our  equation,  the  last  term  always  turned  out  to  be  an  exponential 
average.  We  could  continue  to  substitute  Equation  10-13  for  the  last  term 
and  expand  our  equation  ad  infinitum.  In  theory,  the  infinite  sum  that 
constitutes  a  single  exponential  average  is 
« 

El(c,t)“  £  c*( 1-c)k*Y(t-k)  (Equation  10-14) 

k=>o 

However,  as  a  practical  matter,  an  infinite  series  of  observations  is 
an  impossibility.  Therefore,  we  must  modify  our  equation  to  fit  the 
situation  where  we  have  only  n  observations. 
n-1 

El(c,t)»  £  c*< 1-c)k*Y(t-k)+(1-c)n*E1(c,t-n)  (Equation  10-15) 

k-o 


To  calculate  the  last  term  in  Equation  10-15,  El(c,t-n),  we  need  to  know 
both  Y(t-n)  and  E1(c,t-n-1).  Since  our  data  set  runs  from  Y(t)  to  Y(t- 
n+1),  we  do  not  know  the  value  of  either  of  the  above  two  terms. 

Therefore,  we  cannot  calculate  El(c,t-n).  We  need  to  have  some  value  for 
El(c,t-n),  however,  in  order  to  start  the  exponential  smoothing  process. 
Where  will  our  value  come  from? 


The  solution  to  our  problem  is  to  assume  a  value  for  E1(c,t-n),  and  we 
have  two  alternatives  upon  which  to  base  our  assumption.  Alternative  one 
is  to  let  our  exponential  average  for  time  period  t-n  equal  our  observation 
for  time  period  t-n+1,  or  El (c,t-n)-Y( t-n+1 ) .  Equation  10-13  tells  us  that 

El (c,t-n+1 )»c*Y( t-n+1 )+( 1-c)*E1(c,t-n) 

If  we  let  El ( t, t-n) *Y( t-n+1) ,  then  E1(c, t-n+1'  will  also  equal  Y(t-n+1). 
This  will  become  the  starting  point  for  our  exponential  smoothing  process. 
Alternative  two  would  be  to  set  our  initial  exponential  average  equal  to 
?(t).  To  illustrate  both  alternatives,  let  us  look  at  the  following  data 
set. 

Table  10-24 

t  Y(t) 

1  103 

2  119 

3  117 

4  101 

5  110 
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Let  us  arbitrarily  sat  c-.20.  Under  our  first  assumption  E1( .2, .0)-Y( 1)-103. 
Than  E1( .2/ 1)-( .2)*( 103)+(8)#( 103)-103.  Which  is  what  wa  knew  would  happen 
and, 

El(.2,2)-(.2)*(119)+(.8)*(103.00)-106.20 
B1(.2,3)-(.2)*(117)+(.8)*(106.20)-108.36 
E1(.2,4)-(.2)*( 101 )+(.8)*{ 108.36)-106.89 
B1(.2,5)-(.2)*( 110)+(.8)*(106.89)-107.51 

_  550 

Under  our  second  assumption ,  Y (t)  -  ——  -  no  -  El(c,t-n) . 


Therefore:  E1( .2, 1)»( .2)*( 103)+( .8)*( 110J-108.60 

B1(.2,2)-(.2)*(119)+(.8)*{108.60)-110.68 
E1(.2,3)«(.2)*(117)+(.8)*(110.68)-111.94 
E1(.2,4)-(.2)*(101)+(.8)*(11 1 .94)* 109. 75 
E1(.2,5)-(.2)*( 110)+(.8)*( 109.75 )*109. 80 


Table  10-25  contains  the  single  exponential  averages  calculated  under  the 
two  assumptions  and  the  difference  between  the  two  averages. 

Table  10-25 

El(.2,0)-103  E1(.2,0)-110 


t 

Y(t) 

E1(.2.t) 

E1(.2,t) 

Difference 

T 

103 

103.00 

108.60 

5.60 

2 

119 

106.20 

110.68 

4.48 

3 

117 

108.36 

111.94 

3.58 

4 

101 

106.89 

109.75 

2.86 

5 

110 

109.80 

107.51 

2.29 

A  look  at  the  differences  between  the  two  single  exponential  averages 
should  reveal  two  key  points.  First,  the  alternative  you  select  for 
assuming  a  value  for  El (.2,0)  makes  a  difference  in  the  calculated  values 
of  your  exponential  averages.  Second,  the  difference  between  the  two  expo¬ 
nential  averages  decreases  as  you  move  further  and  further  away  from  your 
starting  point.  Let  us  look  again  at  Equation  10-15. 
n-1 

E1(c,t)-  l  c*(1-cik*Y(t-k)+(1-c)n*KiCc,t-nl 

k»o 

The  only  difference  between  the  two  exponential  averages  in  Table  10-25 
is  in  the  last  term  of  the  equation.  When  we  assumed  that  El ( . 2, 0 )-l 10 , 
the  last  term  of  our  equation  became  (1-.2)n*( 110)-(.8)n*( 110).  When  we 
assumed  that  E1(  .2,0)-103,  the  last  term  of  our  equation  became  (1-.2)n 
*( 102)«( .8)n*103.  Therefore,  the  difference  between  the  two  exponential 
averages  should  be  ( .8n*110)-( •8n*103)-.8n*( 110-103)-. a"*?. 


Table  10-26 


t 

(1-c)n-  ( .8)n 

1 

(.8)1  -.800 

5.60 

2 

(.8)2-.  640 

4.48 

3 

( .8)3-. 512 

3.58 

4 

(.8)4-. 410 

2.87 

5 

(.S)5  -.328 

2.30 
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A  comparison  between  .8n*7  and  the  Difference  column  in  Table  10-25 
shows  the  numbers  to  be  almost  equal.  In  fact,  the  numbers  should  be  exactly 
equal,  but  there  is  some  error  in  the  calculations  caused  by  rounding.  The 
important  point  is  that  the  difference  between  the  two  exponential  averages 
is  declining  at  an  exponential  rate,  i.e.,  (1-c)n.  By  the  time  you  get  to 
time  period  t*10 ,  the  difference  between  these  two  exponential  averages 
will  be  .810*7«0.75. 

The  lesson  to  be  learned  is  that  if  you  are  going  to  calculate  single 
exponential  averages  you  should  have  an  adequate  number  of  data  points  to 
continue  your  calculations  out  far  enough  where  the  effect  of  your  assumed 
value  for  El(c.t-n)  on  the  value  of  your  exponential  average  is  negligible. 
Since  E1(c,t-n)  is  weighted  by  the  coefficient  (1-c)n,  the  smaller  your 
smoothing  constant  the  greater  the  weight  assigned  to  E1(c,t-n)  and  the 
more  observations  you  need  to  reduce  (1-c)n  to  a  small  number.  Table  10-27 
gives  you  the  smallest  number  of  observations  required  to  reduce  the  weight 
assigned  to  E1(c,t-n)  to  a  value  less  than  3  for  different  values  of  g. 


Table  10-27 


Smallest  values  for  n  such  that  (l-c)*1  <  8 


X 

.20 

.15 

.10 

.05 

.01 

.001 

.1 

16 

18 

22 

29 

44 

66 

mam 

8 

9 

11 

11 

21 

31 

5 

6 

7 

7 

13 

20 

KjH 

4 

4 

5 

5 

9 

14 

ni 

3 

3 

4 

4 

7 

10 

As  a  general  rule,  you  would  want  the  weight  assigned  to  your  assumed 
value  for  E1(c,t-n)  to  be  .01  or  less.  Anytime  the  weight  assigned  to 
E1(c,t-n)  becomes  greater  than  .10  you  have  a  potential  source  of  distor¬ 
tion  that  could  introduce  significant  error  in  your  forecast.  If  you  do 
not  have  a  sufficient  number  of  observations  you  can  still  calculate  expo¬ 
nential  averages.  However,  you  should  recognize  in  your  analysis  that  your 
results  may  be  distorted  by  the  combined  effects  of  your  assumed  value  for 
E1(c,t-n)  and  your  limited  data. 

Our  single  exponential  average  is  very  much  like  a  single  moving 
average.  The  primary  difference  is  in  the  weighting  scheme.  Consequently, 
we  would  expect  to  find  that  the  relationship  between  a  single  exponential 
forecast  and  a  single  exponential  average  parallels  the  relationship  bet¬ 
ween  a  single  moving  average  forecast  and  a  single  moving  average. 

Equation  10-7  describes  the  latter  relationship. 

FM1(n,t,t+i)  -  Ml ( t,t+i)  (Equation  io-7) 

Our  single  exponential  forecast  is  similarly  determined  by  our  single  expo¬ 
nential  average. 

FEI(c,t,t+i)-E1(c,t)  (Equation  10-16) 


In  words,  our  single  exponential  forecast  made  in  time  period  t  for  time 
period  t+i  is  the  single  exponential  average  made  in  time  period  t. 


For  an  example,  let  us  reexamine  the  data  set  used  earlier  to  calculate 
single  moving  averages.  The  data,  presented  in  Table  10-14  is  reproduced 
in  Table  10-28. 

Table  10-28 


Earlier  we  analyzed  the  above  data  for  trend  and  found  that  at  the 
95%  level  of  confidence  we  could  accept  the  hypothesis  that  RS«0,  that 
there  is  no  trend.  Therefore,  let  us  begin  our  analysis  by  assuming  that 

E1(c,o)=Y(t)-£27=82.7 

TTT 

For  our  smoothing  constant  let  us  set  c  »  .25. 

Then  El (.25, 1 )-( .25)*( 74)+( .75 )*( 82 . 7)=80 . 525 
and  E1< .25,2)«( .25)*(94)+( .75)*(80.525)«83.894 
and  E1( .25,3)-( .25)*(84)+( .75)*(83.894)-83.921 
and  E1(.25,4)-(.25)*(71)+(.75)*(83.921)-80.691 

For  practice,  you  calculate  El (.25, 5)  for  t*5  through  t-10.  Your 
results  should  look  like  Table  10-29. 

Table  10-29 


t 

Y(t) 

E1(.25,t) 

FBI ( .25 ,t,t+2) 

EFE1 : . 25»t,t+2) 

i 

74 

80.525 

- 

- 

1 

2 

94 

83.894 

- 

• 

3 

84 

83.921 

80.525 

3.475 

4 

71 

80.671 

83.894 

12.894 

5 

88 

82.518 

83.921 

4.079 

6 

81 

82.139 

80.691 

0.039 

m 

7 

77 

80.854 

82.518 

-5.518 

8 

91 

83.391 

82.139 

8.861 

9 

87 

84.293 

80.854 

6. 146 

10 

80 

83.220 

83.391 

-3.391 

i 
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We  measured  the  accuracy  of  our  single  moving  average  forecast  by 
looking  at  the  last  six  observations  and  calculating  the  MAD  and  the  stan¬ 
dard  error  of  the  forecast.  Since  our  early  single  exponential  averages 
are  somewhat  distorted  by  our  assumed  value  for  El (.25,0),  and  to  make  our 
measures  of  forecasting  error  compatible,  let  us  also  calculate  the  MAD  and 
the  standard  error  of  the  forecast  for  FE1( .25,t,t+2)  based  on  observations 
t=5  through  t=10. 


Forecast 

Statistics  FMI ( 3 , 5 , 5+2 )  FE1 ( 25 , t, t+2 ) 

MAD  4.833  4.717 

Standard  Error  5.6421  5.400 


Our  single  exponential  forecast  appears  to  be  slightly  more  accurate 
than  our  single  moving  average  forecast.  However,  the  differences  are  so 
small  as  to  be  almost  insignificant. 

Our  discussion  of  single  exponential  averages  began  by  describing  a 
single  exponential  average  as  similar  to  a  single  moving  average  using 
exponential  weights.  As  a  consequence  of  their  similarities,  you  might 
suspect  that  the  two  averages  would  exhibit  similar  characteristics.  One 
characteristic  of  a  single  moving  average  is  that  it  produces  biased 
forecasts  when  the  data  contains  an  increasing  or  decreasing  trend.  This 
same  bias  is  also  present  in  forecasts  based  upon  single  exponential  avera¬ 
ges.  If  there  is  an  upward  trend  in  the  data,  your  forecasts  will  be  con¬ 
sistently  too  low  and  if  there  is  a  downward  trend  in  the  data,  your 
forecasts  will  be  consistently  too  high. 

You  were  able  to  solve  the  problem  of  forecasting  with  moving  averages 
when  there  is  a  linear  trend  in  the  data  by  incorporating  double  moving 
averages  into  the  calculations  of  your  forecast.  Your  forecast  was  based 
upon  the  parameters  a(M2)  and  b(M2),  which  in  turn  were  dependent  upon  the 
values  Ml(n,t)  and  M2(n,t).  However,  the  equations  for  a(m2)  and  b(M2)  are 
dependent  upon  the  fact  that  the  observations  are  equally  weighted. 

Because  exponential  weighting  represents  a  systematic  scheme  of 
weighting  with  known  and  consistent  mathematical  properties,  equations  have 
also  been  developed  that  will  allow  us  to  use  double  exponential  averages 
as  a  basis  for  a  forecast  when  there  is  a  linear  trend  in  the  data. 

Before  you  cam  calculate  your  forecast,  however,  you  must  first  calcu¬ 
late  the  double  exponential  average.  In  the  discussion  of  single  exponen¬ 
tial  averages,  the  statement  was  made  that  the  equations  for  all  exponential 


averages  are  equivalent  to  the  two-observation  weighted  average  equation 
MW-W( 1 )*Y( 1 )+W( 2)*Y(2) .  Bv  making  several  substitutions  in  the  above 
equation.  Equation .  10-13  for  a  single  exponential  average  was  derived. 

El (c,t)*c*Y( t)+( 1-c)*E1 (c,t-1 )  (Equation  10-13) 

A  double  exponential  average  is  also  a  weighted  average.  The  weights 
are  still  c  and  1-c.  However,  a  double  exponential  average  is  a  weighted 
average  of  the  single  exponential  average  in  time  period  t  and  the  double 
exponential  average  in  the  previous  time  period. 

E2(c,t)*c#E1 (c,t)+( 1-c)*E2(c,t-1 )  (Equation  10-17) 

If  you  were  to  go  through  the  same  exercise  expanding  Equation  10-17  tli^t 
you  did  with  Equation  10-13,  you  would  find  that  Equations  10-14  and  10-15  have 
parallel  counterparts  for  double  exponential  averages. 

For  a  single  exponential  average.  Equation  10-14  says 

co 

E1(c,t)  l  c*(1-c)k*Y(t-k)  (Equation  10-14) 

k-o 

The  parallel  equation  for  a  double  exponential  average  is 
00 

E2( c, t )  I  c*( 1-c)k*E1(c,t-k>  (Equation  10-18) 

k-o 

Therefore,  a  double  exponential  average  is  an  exponentially  weighted 
sum  of  single  exponential  averages  in  the  sane  way  that  a  single  moving 
average  is  an  exponentially  weight  sum  of  the  observations.  Because  you 
never  have  an  infinite  data  set,  you  were  forced  to  modify  Equation  10-14  for 
a  finite  data  set  of  n  observations  to  read 
n-1 

E1(c,t)-  I  c*( 1-c)k *Y(t-k)+(1-c)n*E1(c,t-n)  (Equation  10-15) 

k-o 

Likewise,  to  account  for  the  realism  of  finite  data  sets.  Equation  10-18  must 
be  modified  to  read 
n-1 

E2(c,t)«  I  c*(  1-c)k*E1(c,t-k)  +  (1-c)n*E2(c,t-n)  (Equation  10-19) 

k-o 

At  this  point  you  have  the  same  problem  you  had  with  single  exponential 
averages.  Your  data  set  consists  of  observations  Y ( 1 )  through  Y(n). 

However,  to  calculate  E2(c,t-n)  you  need  to  know  E1(c,t-n)  and  E2(c,t-n-1), 
and  you  know  neither  one.  What  sure  you  going  to  do?  When  discussing 
single  exponential  averages  you  were  forced  to  assume  a  value  for  El(c,t- 
n),  and,  since  there  was  no  trend  in  the  data,  you  chose  to  let  E1(c,t- 
n)-Y(t).  Now  you  must  also  assume  a  value  for  E2(c,t-n)  if  you  are  going 
to  forecast  with  double  moving  averages.  What  value  should  we  assume?  To 
answer  this  question,  you  must  first  discuss  the  double  exponential  fore¬ 
cast. 


The  double  exponential  forecast,  like  the  double  moving  average  fore¬ 
cast,  is  used  to  forecast  a  time  series  variable  when  there  is  a  linear 
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trend.  Therefore,  our  forecast  equation  should  look  like  the  equation  for 
a  straight  line,  and  it  does. 


1  *1 


FE2(c,t,t+i)=a(E2)+b(E2)*i 


(Equation  10-20) 


Like  the  double  moving  average  forecast,  the  parameters  for  our  double 
exponential  forecast  a(E2)  and  b(E2).  In  fact,  the  equations  are  very 
similar.  For  our  double  exponential  forecast,  the  equations  are: 


a(E2)=2*E1(c,t)-E2(c,t) 
b(E2)=*  *|e1  (c,t)-E2(c,t)] 


(Equation  10-21) 
(Equation  10-22) 


We  finished  our  discussion  of  double  exponential  averages  with  the 
question  of  what  values  to  assume  for  E1(c,t-n)  and  E2(c,t-n)  left 
unanswered.  Now  that  we  have  the  equation  for  a(E2)  and  b(E2)  upon  which 
we  will  base  our  double  exponential  forecast,  we  can  use  these  equations  as 
a  basis  for  developing  our  initial  assumed  values  for  our  exponential 
averages . 

Ordinarily,  you  will  calculate  El(c,t)  and  E2(c,t)  first,  and  then 
calculate  a(E2)  and  b(E2).  However,  to  begin  our  exponential  smoothing 
process  we  cure  going  to  reverse  the  two  steps.  We  are  going  to  calculate 
estimates  of  a(E2)  and  b(E2)  for  time  period  t-n  and  then  use  these  estima¬ 
tes  to  find  our  assumed  values  for  El(c,t-n)  and  E2(c,t-n).  Where  are  we 
going  to  get  our  estimates  for  a(E2)  and  b(E2)  in  time  period  t-n?  Answer, 
from  a  LSBF  line. 

An  example  would  best  describe  the  process.  Suppose  we  were  analyzing 
the  data  set  in  Table  10-30. 

Table  10-30 


If  you  were  to  fit  a  LSBF  line  through  this  data  you  would  come  up  with 
the  formula  Y(t)*25+2*t.  From  our  LSBF  line  you  extract  the  two  parame¬ 
ters,  a-25  and  b*2.  These  two  parameters  become  your  estimate  for  a(E2) 
and  b(E2)  for  time  period  t-n.  Substituting  into  Equations  10-21  and  10-22. 

25-2*E1(c,t-n)-E2(c,t-n) 


* 

l-c 


El (c,t-n) -E2(c,t-n) 
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All  that  remains  is  to  select  our  smoothing  constant  and  solve  for  B1(c,t- 
n)  and  E2(C/t-n).  For  the  sake  of  this  example,  let  us  set  c«.2.  Based  on 
our  LSBF  parameters  a-25  and  b~2  we  can  calculate  our  assumed  values  for 
E2(c,t-n)  and  E2(c,t-n)  which  are  17  and  9  respectively.  We  can  now  begin 
to  calculate  our  exponential  averages. 


Table  10-31 


t 

Y(t) 

El ( .2 ,t ) 

E2( .2,t) 

0 

27 

17* 

9* 

1 

30 

19.00 

11.00 

2 

31 

21.2 

13.04 

3 

32 

23.16 

15.06 

4 

34 

24.93 

17.03 

5 

37 

26.74 

18.97 

6 

40 

28.79 

20.93 

7 

41 

31.03 

22.95 

8 

43 

33.02 

24.96 

9 

35.02 

26.97 

♦Assumed  values 

based  on  < 

aur  LSBF 

line 

Given 

a  value  for  E1(.2,t)  and  E2( 

.2,5)  you  would 

l  calculate  a(E2)  and 

b(E2) 

for  each  time  period.  You  would  then  base 

your  linear  forecast  on 

these 

two  parameters.  In 

our  example,  suppose  you  were  forecasting  3 

perl  o 

:.s  into  the 

future . 

Your  forecast  and  your 

forecasting  error  would 

lo_  : 

like  Table 

10-32. 

Table  10-32 

t 

Y(t) 

a(E2) 

b(E2) 

FE2 ( • 2 , t , t+3 )  EFE2: .2,t,t+3) 

1 

27 

27.00 

2.00 

. 

2 

30 

29.36 

2.04 

- 

- 

3 

31 

31.26 

2.03 

- 

- 

4 

32 

32.83 

1.98 

33.00 

-1.00 

5 

34 

34.51 

1.94 

35.48 

-1.48 

6 

37 

36.65 

1.97 

37.45 

-  .35 

7 

40 

39.11 

2.02 

38.77 

1.23 

8 

41 

41.08 

2.01 

40.33 

.67 

9 

43 

43.07 

2.01 

43.56 

.44 

The  values  of  b(E2)  in  Table  10-32  reflect  a  basic  characteristic  of  both 
double  moving  average  forecasts  and  double  exponential  forecasts.  The 
value  of  b{E2)  will  fluctuate  around  the  true  slope  of  the  trend  line. 
However,  if  the  slope  of  the  trend  line  changes,  b(E2)  possesses  the  abi¬ 
lity  to  adjust  to  the  change. 

To  impress  this  point  upon  you,  let  us  look  at  a  set  of  data  that  has  a 
change  in  trend.  Table  10-33  consists  of  nine  observations.  The  first 
observation  is  from  the  trend  line  Y{t)=100+5*t.  Observation  three  through 
nine  are  from  the  trend  line  Y(t)=70+20*t.  Observation  two  is  the  inter¬ 
section  of  the  two  trend  lines.  For  our  initial  estimates  set  a(E2)=a=100 
and  b(E2)“b“5  from  the  first  trend  line.  From  these  estimates  assumed 
values  of  E1(c,o)  and  E2(c,o)  can  be  calculated.  For  a  smoothing  constant 
c=2/7,  E1(c,o)  and  E2(c,0)  equal  87.5  and  75  respectively.  Table  10-33 
demonstrates  how  the  slope  of  a  double  exponential  average  adjusts  for  the 
change  in  trend. 

Table  10-33 


t 

Y(t) 

El  (2/7 ,t) 

E2 ( 2/7 , t ) 

b(E2 ) 

Ab(E2) 

0 

l 

87.500 

75.000 

5 

_ 

1 

105 

92.500 

80.000 

5 

0 

2 

110 

97.500 

85.000 

5 

0 

3 

130 

106.786 

91.225 

6.224 

1.224 

4 

150 

119.113 

99.199 

7.974 

1.750 

5 

170 

133.666 

109.047 

9.848 

1.874 

6 

190 

149.761 

120.680 

11.632 

1.784 

7 

210 

166.972 

133.906 

13.226 

1.594 

8 

230 

184.980 

148.499 

14.529 

1.366 

9 

250 

203.557 

164.230 

15.731 

1.139 

10 

270 

222.541 

180.890 

16.660 

.929 

11 

290 

241.815 

198.297 

17.407 

.747 

12 

310 

261.296 

216.297 

18.000 

.593 

Several  characteristics  of  b(E2)  can  be  detected  by  examination  of  Table 
10-33.  First,  it  is  evident  that  b(E2)  is  responding  to  the  change  in  the 
trend  line.  In  the  ten  time  periods  from  time  period  2  to  time  period  12, 
b(E2)  has  more  than  tripled  from  5  to  18.  Second,  b(E2)  responded  imme¬ 
diately  to  the  change  in  trend.  Time  period  3  is  the  first  time  period  not 
on  the  old  trend  line,  and  b(E2)  increased  from  5  to  6.224.  Take  note, 
however,  that  b(E2)  would  have  increased  even  if  Y(3)  had  been  solely  a 
random  fluctuation  and  not  the  beginning  of  a  new  trend.  Third,  b(E2) 
increased  at  an  increasing  rate  for  the  first  three  observations  after  the 
change  in  trend.  In  the  ten  time  periods  from  time  period  2  to  time  period 
12,  b(E2)  increased  by  260%.  However,  half  that  increase  was  realized  by 
time  period  6,  4  time  periods  after  the  change  in  trend.  Last,  after  time 
period  5,  b(E2)  begins  to  increase  at  a  decreasing  rate.  In  fact,  this 
trend  will  continue  as  b(E2)  approaches  the  slope  of  the  new  trend  line 
asymptotically.  b(E2)  will  never  equal  20  because  the  observations  from 
the  old  trend  line  will  continue  to  affect  the  calculations  of  b(E2)  no 
matter  how  small  their  exponential  weights  become.  If  you  were  to  graph 


i 

r 


0  123456789  10  11  12  t 

Our  forecast  for  any  future  time  period  depends  upon  the  value  of  a(E2) 
and  b(E2).  a(E2)  and  b(E2),  in  turn,  change  as  we  move  from  time  period  to 

time  period.  Therefore,  we  would  expect  that  our  forecast  of  a  future  time 
period  will  also  change  as  we  move  through  time.  For  example,  let  us  com¬ 
pare  our  forecast  of  time  period  t*9  as  we  move  from  time  period  t»1  to 
t=*8.  To  do  this  let  us  use  our  data  from  Table  10-34. 

Table  10-34 


t 

a(E2) 

b(E2 ) 

FE2( 2/7,1 

tr9) 

E(FE2:2/7 ,t,9) 

1 

27.00 

2.00 

43.00 

0 

2 

29.36 

2.04 

43.64 

-.64 

3 

31.26 

2.03 

43.44 

-.44 

4 

32.83 

1.98 

42.73 

.27 

5 

34.51 

1.94 

42.37 

.73 

6 

36.65 

1.97 

42.56 

.44 

7 

39.11 

2.02 

43.15 

-.15 

8 

41.08 

2.01 

43.09 

-.09 

Since  we 

know  that 

Y(9) 

■43 ,  we  can  calculate 

the 

error  in  our  forecast  as 

we  move 

through  time. 

Mote  that 

even  though 

our 

error  was  at  a  minimum  in 

time  period  8,  our 

forecast  did 

not  necessarily 

improve  as  we  moved  forward 

in  time. 

Our  forecast 

was  more 

accurate  in  time 

period  2  than  it  was  in 

time  period  5,  and 

more 

•  accurate 

in  time  period 

4  than  either  time  period 

i 


« 
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5  or  time  period  6.  It  is  important  to  recognize  that  while  your  most 
recent  forecast  may  be  your  "best"  forecast  in  terms  of  the  available  data, 
it  may  not  be  your  most  accurate  forecast. 


We  began  our  discussion  of  single  exponential  averages  by  showing  that 
El(t,c)  was  nothing  more  than  a  weighted  average. 

E1(c,t)=*c*Y(t)  +  (  1-c)*E1(c,t-1 )  (Equation  10-13) 

The  sum  of  the  weights  cure  equal  to  c+(l-c)“l.  Then  we  expanded  our 
equation  for  E1(t,c)  into  an  infinite  exponentially  weighted  sum  of  past 
observations . 

00 

El( t,c)“  l  c*(1-c)k*Y(t-k)  (Equation  10-14) 

k=o 

Does  the  sum  of  our  exponential  weights  also  add  to  one?  The  answer  is 
yes,  provided  that  our  smoothing  constant,  c,  is  less  than  or  equal  to  one 
and  greater  than  or  equal  to  zero. 

00 

l  c*(1-c)k  -1  for  0  <  c  <  1  (Equation  10-23) 

k-o 

Thus  we  can  see  that  our  single  exponential  average  is  equivalent  to  a 
weighted  average  where  the  weights  are  exponential,  they  sum  to  one,  and 
the  number  of  observations  in  the  weighted  moving  average  is  infinite.  We 
then  modified  our  equation  to  account  for  finite  data  sets. 
n-1 

E1(t,c)«  £  c*( 1-c)k*Y(c-k)+( 1-c)n*E1(c,t-n)  (Equation  10-15) 

k=*o 

Do  our  weights  still  add  up  to  one?  The  answer,  mathematically  speaking, 
is  no. 

I  c*(1-c)k  =»  1-(1-c)n+1 
k=o 

However,  you  must  remember  that  theoretically  the  term  E1(c,t-n)  is  itself 
a  summation  of  exponentially  weighted  observations.  The  weights  c*(1-c) 

for  k»n+1  to  infinity  are  all  incorporated  in  the  terra  Ei(c,t-n).  And 
00 

y  «c*( i-c) *■( i-c) n+^ 
k=n+l 

Therefore,  theoretically,  all  of  the  weights  are  present  in  Equation  10-15 
and  they  still  sum  to  one  provided  that  our  smoothing  constant,  c,  is  less 
than  or  equal  to  one  and  greater  than  or  equal  to  zero. 


The  question  now  is,  given  the  restriction  that  6  <  c  <  1,  how  do  we  select 
a  particular  value  for  our  smoothing  constant.  The  component  of  our  expo¬ 
nential  weight  that  declines  as  you  move  backward  in  time  is  (1-c)^.  This 
term  declines  slowly  when  c  is  very  small,  and  quickly  when  c  is  very  large. 
Table  10-35  demonstrates  how  (1-c)*  behaves  for  three  different  values  of  c, 
.95,  .5,  and  .05. 


Table  10-35 


k 

( 1-.95) 

( 1-.5) 

( 1-.05 

1 

.0500 

.500 

.95 

2 

.0025 

.250 

.903 

3 

.125*10"# 

.125 

.857 

4 

•625*10"S 

.625*10“! 

.815 

5 

.313*10"6 

.313*10“! 

.774 

6 

.  156*10”? 

.156*10“! 

.735 

7 

.800*10"9 

.781*10“2 

.698 

8 

.391*10~10 

.391*10“2 

.663 

9 

.195*10“* ! 

.195*10“2 

.630 

There  is  one  important  comparison  that  can  be  made  examining  Table  IQ- 
35.  First,  a  small  c  discounts  the  past  at  a  very  slow  rate.  Older  obser¬ 
vations  can  still  have  a  relatively  significant  effect  on  the  exponential 
average,  and  consequently,  on  the  forecast.  Compared  to  moving  averages,  a 
small  c  corresponds  in  its  effects  to  a  large  n  for  a  moving  average. 
Conversely,  a  large  c  discounts  the  past  at  a  very  rapid  rate.  Older 
observations  have  a  negligible  impact  on  the  exponential  average,  and  con¬ 
sequently  on  the  forecast.  A  large  c  corresponds  in  its  effects  to  a  small 
n  for  a  moving  average. 

Therefore,  your  considerations  in  Chapter  3  regarding  the  selection  of 
n  for  a  moving  average  forecast  also  apply  to  the  problem  of  selecting  c 
for  an  exponential  average  forecast.  If  you  select  a  luge  c,  your  fore¬ 
cast  will  be  very  responsive  to  changes  in  trend,  but  also  subject  to 
distortion  by  random  fluctuations  in  the  data.  If  you  select  a  small  c 
your  forecast  will  be  relatively  unaffected  by  random  fluctuations  in  the 
data,  but  it  will  also  be  unresponsive  to  changes  in  trend.  It  is  the  same 
tradeoff  that  we  have  been  confronting  throughout  this  chapter. 

The  value  you  select  for  c  will  depend  on  the  same  three  factors  that 
influenced  your  selection  of  n  for  a  moving  average:  (a)  your  expectation 
that  the  future  will  bring  a  change  in  trend,  (b)  the  cost  or  loss  to  you 
if  you  fail  to  forecast  a  change  in  trend,  and  (c)  your  ability  to  tolerate 
instability  in  your  forecasts.  The  greater  your  expectations  that  the 
future  will  bring  a  change  in  trend,  the  greater  your  risk  of  loss  if  you 
fail  to  respond  to  a  change  in  trend,  and  the  greater  your  tolerance  for 
instability  in  your  forecast,  the  larger  the  smoothing  constant  you  can 
select. 


A  large  c,  however,  makes  your  forecast  sensitive  to  random  fluc¬ 
tuations  in  the  data.  The  further  out  into  the  future  you  are  trying  to 
forecast,  the  greater  the  magnitude  of  the  potential  distortion  that  can 
occur  because  of  random  fluctuations.  Therefore,  you  must  place  limits  on 
the  maximum  value  you  select  for  c  based  upon  the  number  of  time  periods 
into  the  future  you  are  trying  to  forecast.  A  general  guide  is  to  restrict 
your  chosen  value  for  c  to  the  range  0  to  Table  10-36  lists  represen¬ 

tative  maximum  values  for  smoothing  constants  based  on  this  rule. 


i 


Table  10-36 


2 

0  <  c  <  i+1 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 


1.000 
.670 
.500 
.400 
.330 
.286 
.250 
.222 
.200 
.  182 


If  you  select  a  value  for  your  smoothing  constant  that  is  larger  than  this 
rule  would  indicate,  be  aware  that  your  forecasts  may  be  distorted  by 
the  random  fluctuations  present  in  your  most  recent  observations. 
Furthermore ,  the  further  out  into  the  future  you  forecase,  the  greater  the 
effect  this  distortion  will  be. 
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CHAPTER  11 


COST  ESTIMATION 


INTRODUCTION 


Cost  estimating  or  cost  prediction  encompasses  a  wide  variety  of  tasks 
and  techniques  which,  individually,  may  be  quite  simple,  but  taken  as  a 
whole,  become  quite  complex.  For  example,  estimating  the  costs  for  a  total 
force  structure  is  more  involved  than  individual  system  costing  because  the 
former  must  take  into  account  the  interactions  of  all  the  individual  sys¬ 
tems  which  make  up  the  total  force. 

In  the  acquisition  process,  costs  are  normally  categorized  as: 

Research  and  Development  costs:  All  costs  associated  with 
bringing  a  system  into  production  configuration.  These  costs  include  con¬ 
cept  studies,  engineering  design,  program  management,  advanced  development, 
and  may  even  include  costs  for  construction  of  prototypes. 

Investment  costs:  All  costs  associated  with  phasing  a  new  system 
into  the  operational  inventory.  These  include  manufacturing  or  production, 
construction  of  facilities,  initial  spares  and  support  equipment,  technical 
data,  initial  training,  and  other  initial  logistic  support  costs. 

Operating  and  Support  costs:  All  costs  associated  with  operating 
and  maintaining  a  system  after  it  has  been  phased  into  the  operational 
inventory.  Whereas  Research,  Development  and  Investment  costs  are  "one¬ 
time"  expenditures.  Operating  and  Support  costs  are  recurring  throughout 
the  system's  life  cycle. 

Costs  may  also  be  viewed  as  incremental  when  they  account  for  the 
amount  of  additional  resources  to  be  consumed  after  the  new  system  is  in 
place,  including  consideration  for  any  old  system  phase-outs,  manpower 
realignment,  and  resource  reallocation.  In  essence,  incremental  costing 
ignores  sunk  costs  and  is  concerned  only  with  changes  in  cost  of  a  new 
system  over  and  above  costs  already  being  incurred  on  the  old  system. 

AFSCM  173-1,  Cost  Estimating  Procedures,  describes  the  cost  estimating 
process  as  one  which  involves  five  steps. 

1.  Define  and  plan  the  task. 

2.  Select  the  estimating  structure. 

3.  Collect,  evaluate,  and  adjust  the  necessary  cost  and  cost  related 
data. 

4.  Select  and  apply  the  estimating  method. 

5.  Document  the  analysis  and  results. 


Although  each  of  the  steps  warrants  special  consideration,  the  empha¬ 
sis  here  will  be  directed  toward  steps  3  and  4.  It  is  important  to  note 
that  even  though  data  collection  and  selection  of  estimating  methodology 
are  listed  as  separate  tasks,  to  do  one  without  full  consideration  of  the 
other  would  result  at  best  in  a  haphazard  effort.  For  example,  it  would  be 
just  as  inappropriate  to  formulate  a  cost  model  without  consideration  of 
available  (or  useable)  data  as  it  would  be  to  manufacture  an  airframe 
without  considering  the  engine(s)  to  propel  it, 

COSTING  METHODS 

Except  when  used  to  develop  budget  estimates  which  later  become 
ceilings  on  expenditures,  most  costing  methodologies  are  directed  toward 
reducing  uncertainty,  i.e.,  to  determine  affordability  or  to  aid  in  the 
decision  between  competing  alternatives.  Several  of  the  most  commonly  used 
methods  are  given  below. 

Industrial  Engineering.  The  industrial  engineering  method  for  esti¬ 
mating  the  cost  of  a  system  or  component  consists  of  aggregating  part  by 
part  estimates  to  get  total  system  cost  including  the  cost  of  assembly. 

This  method  requires  that  the  actual  system  final  configuration  be  known. 
Obviously,  the  industrial  engineering  approach  cannot  be  used  for  cost  esti 
mating  in  the  conceptual  and  early  development  phases  where  engineering  de¬ 
signs  are  in  a  state  of  flux.  Furthermore,  because  of  the  large  number  of 
separate  estimates  to  be  aggregated,  this  approach  is  often  cumbersome  and 
costly  to  perform.  And  finally,  even  when  system  (or  component)  configura¬ 
tion  is  known,  it  is  often  very  difficult  to  obtain  the  necessary  cost  data 

Analogy.  Analogy  cost  estimating  can  be  used  for  costing  equipment 
items  when  technical  specifications  are  available  and  the  specifications 
and  costs  are  known  for  a  comparable  item.  Ordinarily,  analogy  cost  esti¬ 
mating  is  used  when: 

1 .  A  quick  study  is  needed ; 

2.  Estimates  are  needed  for  low-cost  system  components;  or, 

3.  Where  it  is  not  practical  to  secure  a  large  sample  of  past  costs 
and  technical  data. 

To  apply  this  method  the  analyst  must  first  identify  a  suitable  "old" 
counterpart  for  the  new  item.  Then  an  appropriate  technical  characteristic 
must  be  chosen,  since  this  approach  assumes  that  cost  is  proportional  to 
the  magnitude  of  some  technical  characteristic.  The  following  example 
illustrates  how  the  weight  of  am  item  may  be  used  to  estimate  the  cost  of  a 

new  component .  „ 

"n 

Cn  "  Co  x  w^ 

where  C  n  -  estimated  cost  of  new  component 

C  0  »  actual  cost  of  a  comparable  component 
W  n  *  weight  of  the  new  component 
W  0  ■  weight  of  the  old  component 


Although  its  usefulness  should  not  be  ignored,  analogy  cost  estimating 
has  a  serious  limitation.  It  uses  only  one  observation  as  a  basis  for  the 
estimate  as  well  as  only  one  explanatory  variable,  which  is  weight  in  the 
previous  example.  If  unusual  circumstances  are  associated  with  either  the 
new  or  old  component,  the  resulting  estimate  may  be  severely  biased. 

The  major  advantage  of  analogy  costing  is  that  it  is  quick  and  easy  to 
apply.  In  addition,  it  is  often  used  to  obtain  "rough"  initial  estimates 
or  as  a  check  of  the  reasonableness  of  other  more  complex  methods  of  costing. 

Cost  Estimating  Relationships.  One  of  the  most  useful  cost  estimating 
techniques  is  the  cost  estimating  relationship  ( CER ) .  The  CER  is  developed 
from  historical  data  from  a  similar  system  or  component  to  predict  costs 
for  the  new  system.  CERs  normally  relate  cost  as  the  dependent  variable  to 
performance  parameters  or  other  system  characteristics,  the  independent 
variables,  in  the  form  of  a  multiple  regression  model.  See  Chapter  13  for  a 
discussion  of  regression  analysis.  This  technqiue  of  relating  system  para¬ 
meters  to  cost  is  commonly  referred  to  as  parametric  costing. 

Separate  CERs  are  usually  developed  for  each  of  the  cost  categories 
which  comprise  the  major  system.  The  CERs  are  then  summed  to  obtain  the 
total  system  cost  estimate.  For  example,  the  following  CER  was  developed 
to  estimate  the  avionics  cost  per  unit  for  a  fighter  aircraft: 


C  -  3.296  +  .05185X!  +  .0305X2  *  .00555X3  +  .0213X4  -  .003704X5 

where  C  ■  Production  cost  of  the  150th  unit  in  millions  of  1975  dollars 

X^  thru  X5  -  Complexity  factors  derived  through  other  CERs  involv¬ 
ing  characteristics  of  the  aircraft  performance  and  design,  such  as  gross 
take-off  weight,  cruising  speed,  number  of  crew  members,  thrust,  etc. 

Although  the  example  above  is  a  linear  function  and  lends  itself  very 
well  to  least  ~iquares  regression  techniques,  many  CERs  can  and  have  been 
developed  using  nonlinear  curve  fitting  techniques  as  in  the  following 
example  estimating  avionics  production  costs  for  a  cargo  aircraft. 


C  «  (1.7926  x  lO-^X^.MO  •  x2l-340  ,  x3-960 

where  C  *  Production  cost  of  the  150th  unit  in  millions  of  1975  dollars. 
Xj  *  First  flight  year 


X2  ■  Total  quantity  of  avionics  subsystems  per  aircraft 


X3  -  Length  of  aircraft  fuselage 


However,  an  equation  of  this  form  can  be  transformed  into  a  linear 
equation  by: 

log  C  -  log (1.7926  x  10“9)  +  2.54  log  ^  +  1.34  log  X2  +  .96  log  X3 

The  manager  should  pay  particular  attention  to  the  fact  that  any  CER 
provides  a  single,  point  estimate  of  cost.  This  is  am  average  cost,  some¬ 
times  referred  to  as  a  "single  best  guess".  It  is  usually  much  more  bene¬ 
ficial  to  provide  the  user  with  a  range,  or  bounded  estimate  of  costs  in 
the  form  of  a  prediction  interval.  A  prediction  interval  enables  the  user 
to  make  statistical  statements  about  the  costs  estimated  by  the  CER.  For 
exaimple,  one  might  say  that:  "I  am  95  percent  confident  that  the  cost  of 
the  new  engine  will  be  between  $409,000  amd  $621,000",  as  opposed  to  using 
the  point  estimate  of  $515,000.  This  becomes  extremely  important,  if  not 
critical,  when  the  CER  itself  indicates  a  weak  relationship  among  the 
dependent  and  independent  variables.  After  all  CERs  and  corresponding  pre¬ 
diction  intervals  have  been  developed,  the  cost  estimated  by  each  are  then 
summed  to  find  the  total  system  cost.  Although  the  point  estimates  can  be 
totalled  directly,  prediction  intervals  cannot.  The  Rand  Corporation  has 
developed  a  technique  for  constructing  prediction  intervals  for  these 
summed  totals.  The  application  of  the  technique  is  dependent  upon  the 
variance  (amount  of  error)  which  exists  in  the  individual  CERs.  A  complete 
discussion  of  this  technique  can  be  found  in  the  Rand  Corporation  memoran¬ 
dum  RM-5806-PR,  October  1968. 

Cost  Analysis.  Useful  applications  of  the  techniques  presented  in  this 
chapter  are  found  in  the  various  methods  of  cost  analysis.  Indeed,  most 
forms  of  cost  analysis  involve  cost  estimates  of  some  kind.  The  objective 
of  cost  analysis  is  to  use  cost  as  an  input  in  the  decision  making  process. 

In  this  regard,  cost  estimates  add  vital  information  and  help  to  reduce  the 
uncertainty  surrounding  the  choice  among  alternatives.  Cost  analysis  has 
assumed  an  increasingly  more  important  role  in  the  current  era  of  tight 
defense  budgets,  rising  costs,  and  scarce  resources.  In  the  Department  of 
the  Air  Force,  the  responsibility  for  cost  analysis  falls  under  the  Direc¬ 
torate  of  Cost  and  Management  Analysis,  Comptroller  of  the  Air  Force.  Actual 
analyses,  however,  are  accomplished  at  all  levels  of  Air  Force  operations. 

For  whatever  purpose,  to  support  a  budget  estimate,  a  base  closing,  or  a 
contract  for  janitorial  services,  cost  analysis  provides  a  valuable  input 
in  the  decision  making  process .  Depending  upon  its  intended  purpose ,  a 
cost  analysis  may  take  on  one  of  the  specific  forms  described  below. 

Benefit-Cost  Analysis.  One  of  the  techniques  of  cost  analysis  which 
has  been  used  by  the  Army  Corps  of  Engineers  is  benefit-cost  analysis.  Its 
primary  purpose  is  to  show  a  relationship  between  the  costs  incurred  in  a 
specific  project  and  the  benefits  to  be  derived  from  that  project.  This 
technique  has  frequently  been  used  to  analyze  (and  sometimes  justify) 

Federal  water  resources  projects  such  as  dams.  In  a  benefit-cost  analysis, 
costs  which  will  be  incurred  (C)  are  combined  with  the  benefits  to  be 
achieved  (B)  over  a  specified  period  of  time  in  the  following  ratio: 

B 

r 
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When  the  ratio  exceeds  1,  obviously,  benefits  exceed  costs;  which 
would  indicate  the  project  may  be  worthwhile.  When  two  or  more  alter¬ 
natives  are  being  considered,  the  project  with  the  highest  benefit-cost 
ratio  would  prevail,  in  the  absence  of  other  decision  criteria.  In  addi¬ 
tion,  the  benefit-cost  ratio  could  be  used  to  rank  projects  which  cure  not 
mutually  exclusive.  This  would  give  the  decision  maker  an  indication  of 
which  projects  would  yield  the  greatest  benefits  in  relation  to  their  costs. 
This  technique,  however,  is  not  without  serious  drawbacks.  In  the  first 
place,  it  does  not  consider  the  magnitude  of  the  costs/benefits  involved. 

If  the  decision  maker  ranks  all  of  the  projects  by  B/C,  the  very  first  pro¬ 
ject  may  consume  the  entire  budget,  leaving  nothing  for  other  necessary 
programs.  Secondly,  significant  difficulties  arise  when  the  analyst 
attempts  to  estimate  downstream  benefits,  particularly  long-term  benefits. 
Despite  its  shortcomings,  however,  benefit-cost  analysis  has  found  some 
successful  applications.  Its  use  has  been  extended  to  such  diverse  areas 
as  medical  research  and  urban  transportation  problems. 

Cost  Effectiveness  Analysis.  This  type  of  cost  analysis  is  similar  to 
the  method  discussed  above,  except  that  a  measure  of  effectiveness  is  used 
as  the  benefits  to  be  attained.  For  example,  in  comparing  alternate  weapon 
systems  under  consideration,  the  ratio  of  system  availability  versus  life 
cycle  cost  could  be  formed  as: 


Availability 

LCC 

Other  possibilities  include:  Mean-tipe-between-fa^.-'  re  Reliability 

Life  cycle  costs  LCC 

In  each  case,  the  highest  effectiveness/cost  ratio  might  be  basis 
which  could  be  used  by  the  decision  maker  to  aid  in  the  decision  between 
alternatives.  This  technique  is  particularly  useful  when  the  life  cycle 
costs  of  competing  systems  are  roughly  similar. 

Break-even  Analysis.  This  technique  of  cost  analysis  is  especially 
useful  in  resolving  the  problem  of  equipment  replacement.  In  break-even 
analysis,  the  analyst  computes  the  cost  stream  of  keeping  the  old  system  on 
hand  and  compares  it  with  the  cost  profile  of  the  proposed  system  as 
illustrated  in  the  following  chart. 


Here,  System  A  is  the  old  system  which  will  incur  an  increasing  amount 
of  operation  and  support  costs  as  the  system  continues  to  age  and  wear  out. 
The  investment  costs  for  System  B,  the  proposed  replacement,  cause  the 
total  costs  for  System  B  to  be  higher  initially.  After  year  t,  the  break¬ 
even  point,  total  system  cost  for  A  exceeds  that  of  B.  If  the  useful  life 
of  both  systems  is  greater  than  t,  then  System  B  should  be  purchased  now. 

Although  the  example  is  over-simplified,  the  technique  has  been  found 
to  be  valuable  in  replace  versus  modify  decisions.  Its  utility,  however, 
diminishes  when  high  technology  items  with  relatively  short  useful  lives 
are  considered. 


Cost  Sensitivity  Analysis.  One  of  the  primary  functions  of  cost  estimating 
and  cost  analysis  is  to  provide  information  to  the  user  concerning  the 
sensitivity  to  total  costs  to  changes  in  assumptions  or  input  parameters. 

For  example,  a  sensitivity  analysis  could  be  performed  to  determine  how 
a  change  in  hours-between-removal  of  an  engine  affects  the  total  mainte¬ 
nance  costs  for  the  engine.  If  the  analysis  shows  that  there  is  minimal 
effect,  efforts  to  reduce  engine  maintenance  costs  can  be  directed  else¬ 
where. 

Since  cost  models  are  developed  and  used  extensively  in  cost  estimating, 
sensitivity  analysis  is  performed  by  varying  inputs  to  the  model  and 
examining  the  resulting  cost  estimate  (output)  for  significant  changes. 
Similarly,  the  assumptions  which  were  made  in  model  development  should  also 
be  tested  for  cost  sensitivity.  For  example,  if  am  operating  and  support 
cost  (0  and  S)  model  for  am  aircraft  is  based  upon  the  assumption  that 
arnnual  flying  hours  will  be  1,100  hours,  then  sensitivity  amalysis  should 
be  performed  to  determine  the  inpact  on  0  and  S  costs  from  chamges  in  the 
annual  flying  hour  program. 

SPECIAL  CONSIDERATIONS 

Although  the  cost  estimating  technique  used  will  depend  largely  upon 
the  purpose  for  which  the  estimate  is  being  developed,  there  are  certain 
criteria  which  should  be  followed  during  any  cost  estimating  process. 

Data.  Since  the  value  of  the  cost  estimate  depends  upon  the  accuracy  of 
the  cost  data  used,  a  great  deal  of  care  should  be  taken  to  insure  that  the 
data  source (s)  is/are  reliable.  Whenever  possible,  two  sources  should  be 
used,  one  to  verify  the  ocher. 

Tracking.  Cost  estimates  are  often  used  to  forecast  or  predict  costs  which 
will  be  incurred  during  some  future  interval  of  time.  When  this  is  the  case, 
the  estimator  should  compare  the  estimate  with  the  actual  costs  when  they 
occur.  This  procedure,  called  tracking,  provides  valuable  information.  If 
significant  variances  are  detected  between  actual  and  estimated  costs,  the 
cause  of  the  variance  should  be  investigated.  Was  the  estimate  erroneous 
because  of  the  estimating  technique,  data,  or  assumptions?  The  answers  to 
the  question  can  serve  as  valuable  inputs  to  future  cost  estimates  and  will 
help  to  Improve  the  process. 


Present  Value.  Whenever  a  cost  estimate  is  required  as  an  input  to  the 
selection  of  a  system  or  project,  the  costs  (and  savings)  must  be  dis¬ 
counted  at  a  rate  of  10  percent,  except: 

1.  Water  resources  projects, 

2.  Commercial  services  obtained  under  AFR  26-12,  or 

3.  When  the  project  or  system  has  an  expected  useful  life  less  than 
three  years. 

Methodology  for  applying  the  present  value  criterion  is  contained  in 
Chapter  21. 

Inflation.  When  deriving  cost  estimates,  costs  should  be  presented  and 
evaluated  with  constant  and  current  (inflated)  dollars.  Specific  indices 
for  this  purpose  cam  be  found  in  AFP  173-1"!,  USAF  Cost  and  Planning  Factors, 
Vol  I.  When  estimating  future  costs,  dollar's  should  first  be  inflated 
through  the  use  of  an  approved  index,  and  then  discounted  at  the  appropriate 
rate. 

CONCLUSION 


The  techniques  and  methodologies  described  in  this  chapter  merely 
scratch  the  surface  on  the  use  of  costs  in  reducing  uncertainties 
surrounding  the  decision  process.  The  choice  of  the  appropriate  technique 
depends  ultimately  upon  the  objective  of  the  analysis  and  the  data 
available  as  inputs. 
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CHAPTER  12 


LIFE  CYCLE  COST 


IMTBODOCTION 


The  rapidly  increasing  costs  of  modem  defense  systems  and  even  store 
rapidly  increasing  costs  to  operate  and  support  them  has  been  a  major 
concern  of  Department  of  Defense  (DOD)  managers.  Recognizing  that  down¬ 
stream  operating  and  support  costs  often  are  several  times  greater  than 
the  initial  acquisition  cost  of  a  system/  DOD  managers  have  introduced 
the  concept  of  life  cycle  cost  into  the  systems  development  and  acquisi¬ 
tion  process. 

AFR  800-11  defines  life  cycle  cost  (LCC)  as  the  total  cost  of  an  item 
or  system  over  its  full  life,  including  development,  acquisition,  opera¬ 
tion  and  support,  and  disposal.  In  essence,  LCC  represents  the  total 
cost  of  ownership  of  a  system  or  component  from  "cradle  to  grave." 

With  the  introduction  of  LCC,  cost  has  been  elevated  as  a  major 
decision  criteria,  equal  in  importance  with  performance  and  schedule. 

Cost  in  this  sense,  of  course,  means  life  cycle  cost.  And  in  order  to 
reduce  total  life  cycle  cost,  the  impact  of  various  systems  designs  and 
configurations  on  operating  and  support  costs  must  be  carefully  evaluated 
early  in  the  system's  design  phases. 

Figure  12-1  illustrates  the  relationships  of  the  major  categories  of 
LCC  to  the  phases  of  the  systems  life  cycle.  After  Milestone  O,  the 
decision  to  proceed  into  the  succeeding  phase  of  system  development  must 
be  supported  by  a  thorough  analysis  of  the  LCC  impact  of  alternate  designs. 
The  objective  is  to  insure  that  development  continues  not  only  with  a  sys¬ 
tem  which  is  needed,  but  one  which  can  be  afforded. 

LCC  APPROACHES  TO  SYSTEM  ACQUISITION 

Life  cycle  cost  can  be  viewed  as  a  three-sided  approach  to  system 
cost  management  with  the  single  purpose  of  acquiring  systems  which  meet 
minimum  operational  requirements  for  the  lowest  overall  cost. 

In  the  first  place,  LCC  is  a  procurement  technique.  Competing  systems 
are  evaluated  on  their  total  cost  over  their  useful  lives,  rather  than 
solely  on  their  acquisition  costs. 

Secondly,  LCC  is  an  estimating  technique.  Comparisons  are  made  among 
existing  systems  and  those  which  have  been  proposed.  In  this  respect, 
the  costly  lessons  learned  in  the  past  play  an  important  role  in  the 
selection  and  development  of  new  systems.  LCC  also  encourages  early 
trade-off  studies  to  determine  the  impact  on  total  costs  for  increased 
performance,  reliability,  and  maintainability. 
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And  thirdly,  LCC  is  used  to  develop  design  targets.  Cost  goals  are 
established  to  control  operating  and  support  costs  as  well  as  acquisition 
costs.  The  tern  design-to-cost,  formerly  associated  with  acquisition 
costs  only,  has  taken  on  added  meaning  so  that  now  it  is  more  accurately 
termed  design- to-LCC.  Design- to- LCC  encourages  design  trades  so  that  the 
overall  cost  impact  of  alternative  system/component  designs  can  be 
determined.  In  viewing  total  system  cost,  it  could  be  beneficial  to 
incur  an  increase  in  acquisition  costs  if  the  net  result  is  lower  down¬ 
stream  operating  and  support  costs  and  ultimately,  life  cycle  cost. 

The  general  approach  stipulated  when  developing  new  items/sy stems  is 
to  use  the  design-to- life-cycle-cost  approach,  commonly  with  a  design  to 
unit  production  cost  goal  and  with  contractual  riders  concerning  owner¬ 
ship  costs.  This  approach  underlines  the  idea  that  cost  has  become  a 
parameter  equal  in  importance  with  performance  and  schedule.  This 
requires  techniques  to  estimate  all  LCC  categories  during  the  design 
phase,  with  sensitivity  to  design  changes.  This  is  not  easy  or  straight¬ 
forward  and  cannot  be  done  with  absolute  accuracy,  but  techniques  exist 
which  are  believed  to  give  reasonable  accuracy,  or  "figures  of  merit" 
which  can  be  used  in  the  various  decisions  required,  such  as  design 
trades,  source  selection,  and  structuring  contractual  incentives. 

All  standard  cost  estimating  methods  and  techniques  are  used  in  life 
cycle  costing.  Considerable  use  is  made  of  the  analogy  and  parametric 
methods  since  they  can  be  structured  to  use  data  available  in  the  design 
phase.  (See  Chapter  11).  The  regression  technique  is  widely  wed  in 
estimating  LCC  (See  Chapter  13). 

LCC  Categories 

A  typical  life  cycle  cost  breakdown  structure  is  shown  in  Figure  12-2. 
All  costs  associated  with  the  system(s)  under  consideration  are  classified 
under  the  three  major  cost  categories:  Research  and  Development,  Invest¬ 
ment,  and  Operations  and  Support.  Such  a  categorization  of  costs  can 
serve  several  purposes: 

1.  Data  collection  -  cost  data  can  be  accumulated  into  specific 
"accounts"  which  have  been  identified  as  being  the  most  useful. 

2.  Comparison  of  alternatives  -  when  costs  for  competing  systems  are 
collected  under  standard  formats,  comparing  these  systems  on  a  cost  basis 
becomes  an  easier  and  more  meaningful  task. 

3.  Model  building  -  with  standard  classifications  of  costs,  models 
can  be  constructed  from  a  readily  available  data  base. 

4.  Trade-off  analysis  -  a  cost  breakdown  structure  facilitates  oost 
trade-offs  among  conpeting  systems  as  well  as  identification  of  major 
cost  drivers. 


For  a  typical  (but  not  all)  weapon  system,  about  15  percent  of  LCC 
ia  spent  on  development,  about  25  percent  on  production  and  facilities, 
and  60  percent  for  initial  logistics  support  and  operating  and  support 
costs. 

It  is  both  interesting  and  noteworthy  that  this  rough  breakout  of 
costs  is  shared  not  only  by  the  commercial  aircraft  industry,  as  might  be 
expected,  but  also  by  automobiles,  televisions,  and  refrigerators. 

The  cost  breakout  in  Figure  12-2  should  in  no  way  be  construed  to  be 
the  optimum  categorization  of  LCC.  It  is  important  for  the  analyst  to 
categorize  costs  in  ways  which  will  be  meaningful  to  the  decision  maker. 
However,  when  LCC  is  used  to  compare  alternative  systems  or  designs,  it 
is  important  to  use  the  same  basis  for  comparison.  This  implies  that  the 
same  cost  categories  should  be  used  in  evaluating  each  system. 

RELEVANT  VERSUS  INCREMENTAL  COSTS 


It  should  also  be  noted  that  the  use  of  LCC  in  choosing  among  alter¬ 
natives,  only  the  relevant  and  incremental  costs  should  be  considered. 
Relevant  costs  are  defined  as  expected  future  costs  which  differ  under 
various  alternatives. 

To  illustrate,  suppose  a  cost  analyst  is  attempting  to  find  the  oost 
impact  of  various  maintenance  concepts,  i.e.,  three-level  vs  two-level, 
or  in-house  vs  contractor  operated,  etc.  Since  air  crew  training  costs 
are  not  affected  by  the  choice  of  maintenance  concept,  these  costs  are 
not  relevant  and  should  not  be  considered  in  this  particular  analysis. 

Similarly,  incremental  costs  are  defined  as  additional  costs  required 
to  achieve  a  stated  capability  or  level  of  activity,  incremental  costing 
is  very  useful  in  analyzing  replacement  systems.  In  such  cases,  only  the 
costs  which  differ  from  the  old  system  need  to  be  considered  when  evalu¬ 
ating  the  replacement  system.  Positive  incremental  costs  are  those 
required  costs  which  exceed  the  costs  already  being  incurred  with  the 
current  system.  On  the  other  hand,  cost  savings  to  be  achieved  with  the 
replacement  system  can  be  viewed  as  negative  incremental  costs. 

In  the  final  analysis,  the  use  of  the  concepts  of  relevant  and 
incremental  costs  forces  the  decision  maker  to  focus  attention  on  the 
most  important  factors  affecting  the  costs  of  alternatives. 

LCC  Models 


Life  cycle  costing  often  involves  extensive  use  of  models.  A  LCC 
model  is  merely  a  representation  of  the  significant  cost  relationships  of 
the  item/system  under  consideration.  Models  have  been  developed  to  pro¬ 
vide  estimates  of  total  cost  and  estimates  of  design-related  cost. 


There  are  literally  hundreds  of  life  cycle  cost  models  which  exist 
today.  Each  has  been  constructed  for  a  specific  purpose/program.  To  be 
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useful,  however,  each  model  should  exhibit  the  following  characteris¬ 
tics  : 

1.  Completeness :  The  model  should  include  all  the  elements  of 
cost  which  are  appropriate  for  the  issue  being  considered.  If  two 
alternatives  are  under  evaluation  but  replenishment  spares  cost  the  same 
for  each  system,  then  the  model  need  not  consider  replenishment  spares. 

2.  Validity :  The  model  should  represent  reality.  The  analyst  must 
verify  the  input  data  to  insure  that  the  model  output  represents  the 
real  world  environment.  Without  model  validity,  the  user  would  not  be 
able  to  trust  the  output.  In  the  validation  process,  the  analyst  must 
pay  particular  attention  to  logic  and  consistency. 

3.  Sensitivity :  Each  model  should  be  sensitive  to  changes  in 
parameters  or  design.  It  should  be  able  to  show  the  differences  in 
costs  which  would  occur  when  system  configurations  are  altered.  Each 
model  should  also  be  able  to  identify  the  major  cost  drivers  of  a  sys¬ 
tem  or  subsystem.  Management  emphasis  can  then  be  concentrated  on  those 
parameters/items  which  are  responsible  for  the  greatest  costs. 

4.  Availability  of  input  data;  Accurate  input  data  must  be  avail¬ 
able  not  only  to  constrict  the  model,  but  also  for  meaningful  applica¬ 
tion  of  the  model.  Data  availability  has  generally  not  been  a  problem. 

The  problem  lies  in  the  accuracy  of  the  data  and  the  format  in  which  it 
is  collected.  Both  cause  the  analyst  extreme  difficulties.  A  problem 
also  exists  in  the  proliferation  of  data.  For  example,  one  O&S  cost 
model  used  more  than  200  data  sources  from  both  government  and  industry 
sources. 

TYPES  OF  LCC  MODELS 

Life  cycle  cost  models  have  been  classified  into  categories  which 
describe  how  the  models  are  used  or  how  they  are  formulated.  The  most 
common  classifications  are  enumerated  below.  These  classes  are  not 
mutually  exclusive,  i.e. ,  a  given  model  may  be  included  under  two  or 
more  categories.  \ 

There  is  a  common  misconception  that  a  life  cycle  cost  model  con¬ 
tains  the  totality  of  life  cycle  costs  for  a  system/component.  This  is 
generally  not  true.  The  vast  majority  of  LCC  models  consider  only  a 
portion  of  total  LCC.  This  is  in  keeping  with  the  established  concepts 
that  only  relevant  and  incremental  costs  should  be  considered.  At  the 
same  time,  however,  there  is  occasionally  a  need  to  determine  or  calculate 
the  total  LCC  for  a  system.  Several  models  are  available  for  accom¬ 
plishing  this  task. 

In  general,  each  LCC  model  can  be  classified  under  one  or  more  of 
the  following  categories: 

1.  Accounting  models.  Accounting  models  are  cash  register-like 
models  in  which  the  components  of  cost  are  aggregated  through  sets  of 


aquations.  Operating  and  support  cost  models  are  usually  found  in  this 
category  which  accumulates  costs  in  "accounts ,"  such  as  depot  mainten¬ 
ance,  training,  initial  spares,  etc.  The  logistic  Support  Cost  (LSC) 
model  is  basically  an  accounting  model.  The  LSC  model  aggregates  costs 
for  spares,  fuel  consumption,  personnel  training,  data,  etc.  The  model 
also  shows  the  percent  of  LCC  encompassed  by  each  cost  category. 

2.  Cost  factor  models.  These  models  estimate  each  cost  element 
using  a  factor  derived  from  Air  Force  experience  on  similar  systems. 

A  large  collection  of  cost  factors  is  maintained  on  a  current  basis  in 
AST.  173-13,  PSAF  Cost  and  Planning  Factors.  Among  the  thousands  of 
factors  contained  in  AFP  173-13  are  the  standard  pay  rates  for  all  AF 
grades,  fuel  and  oil  consumption  and  costs  per  flying  hour  for  each 
M/D/S  aircraft,  annual  and  per  flying  hour  costs  for  depot  maintenance, 
etc.  Cost  factor  models  have  been  used  by  the  Air  Force  in  estimating 
missile  and  aircraft  squadron  annual  operating  and  support  costs. 

Examples  are  the  the  Cost  Analysis  Cost  Estimating  (CAGE)  model  and  the 
Planning,  Programming,  Budgeting  Annual  Cost  Estimating  (BACE)  model. 

Both  models  cam  be  found  in  AFP  173-13. 

A  typical  cost-factor  equation  contained  in  the  BACE  model  for 
estimating  depot  maintenance  cost  is: 

Cost  ■  (UE)  •  [ (UE  Factor)  +  (FH  Factor  •  FH) ] 

where, 

UE  -  number  of  aircraft  (unit  equipment) 

FH  »  number  of  flying  hours 

3.  Cost  estimating  relationship  models.  CER  models  are  a  form  of 
parametric  cost  estimating  which  relates  historical  cost  data  to  design, 
performance,  or  environmental  parameters.  CER  models  are  statistically 
derived  through  regression  analysis  and  have  the  advantage  of  using 
relatively  few  data  inputs  to  estimate  a  large  portion  of  total  costs. 

To  illustrate,  a  major  aerospace  corporation  formerly  was  using 
15,000  lines  of  data  to  define  the  maintenance  work  load  for  a  missile 
system.  The  company  found  that  through  the  use  of  CERs  only  108  lines 
of  data  were  needed  to  define  91  percent  of  the  work  load. 

The  Modular  Life  Cycle  Cost  Model  (MLCCM)  for  Advanced  Aircraft  is 
a  parametric  LCC  model  which  uses  a  luge  number  of  CERs  to  estimate  LCC. 
The  model  was  developed  for  the  Air  Force  Flight  Dynamics  Laboratory  to 
conduct  design/cost  performance  trade  studies  of  advanced  aircraft  sys¬ 
tems  during  conceptual  and  preliminary  design  phases.  MLCCM  has  been 
loaded  onto  the  AFLC  CREATE  computer  system  to  provide  maximum  flexibility 
for  users  of  the  model. 


Moat  CER  models  consist  of  many  individual  CER  equations,  each  esti¬ 
mating  a  different  component  of  LCC.  A  CER  model  for  an  aircraft  sys¬ 
tem  would  have  several  CERs  related  to  engine  costs.  For  example,  one 
CSR  could  relate  turbine  intake  temperature  and  thrust  to  engine  manu¬ 
facturing  costs.  Another  might  relate  hours  between  removals  to  depot 
maintenance  costs.  By  summing  the  individual  CERs,  an  estimate  of  total 
LCC  can  be  obtained  by  the  analyst.  And  since  the  CERs  are  statisti¬ 
cally  derived,  confidence  intervals  for  the  estimates  can  be  constructed. 

However,  a  problem  arises  when  constructing  a  confidence  interval 
for  the  total  LCC  which  has  been  found  by  susming  the  CERs.  this  prob¬ 
lem  can  be  resolved  by  using  a  special  technique  which  is  explained  in 
the  final  section  of  this  chapter. 

4.  Economic  analysis  models.  Economic  analysis  models  are  usually 
developed  to  assess  costs  versus  benefits  problems  or  to  aid  in  lease 
versus  buy  decisions.  These  models  enable  the  user  to  make  trade-offs 
between  investment  and  O&S  costs  and  consider  the  time  value  of  money 
through  the  use  of  present  value  discounting  techniques. 

The  Army  Corps  of  Engineers  has  frequently  used  economic  analysis 
models  to  assess  the  benefits/costs  of  waterway  projects  such  as  dams 
and  bridges.  Recent  developments  have  resulted  in  economic  analysis 
models  for  use  in  determining  the  value  of  research  projects.  A  model 
which  relates  research  costs  and  potential  benefits  can  be  very  useful 
in  deciding  whether  a  particular  research  project  should  be  undertaken. 

5.  Simulation  models.  Used  primarily  in  the  logistic  support  cost 
area,  simulation  models  are  well  suited  in  analyzing  the  cost  sensi¬ 
tivity  relative  to  major  program  changes,  such  as  flying  hours, 
maintenance  concept,  or  operational  scenario.  The  Logistics  Composite 
(LOOM)  model  is  an  example  of  a  simulation  model  widely  used  by  the 
Tactical  Air  Command  to  assess  the  costs  of  various  deployment  and  con¬ 
tingency  plans. 

6.  Other  models.  A  number  of  other  LCC  models  have  been  developed 
to  estimate  costs  associated  with  warranties,  reliability  improvement, 
manpower,  inventory  management,  and  level  of  repair  (maintenance  concept) 

USES  OF  LCC  MODELS 

LCC  models  have  been  established  to  accomplish  a  wide  variety  of 
cost  estimating  tasks.  Many  of  these  tasks  have  already  been  described 
in  Chapter  11,  Coat  Estimation.  Although  most  models  have  been  created 
for  a  specific  purpose,  such  as  to  estimate  production  costs  for  a  par¬ 
ticular  aircraft  system,  others  can  be  used  to  estimate  costs  for  a 
variety  of  aircraft  systems.  The  Modular  Life  Cycle  Cost  Model  for 
Advanced  Aircraft  falls  into  the  latter  category.  The  user  should 
beware  of  the  purpose  for  which  a  particular  model  was  created.  Each 
model  cannot  and  should  not  be  used  indiscriminately.  In  other  words, 
the  peculiar  requirements  for  each  LCC  estimating  problem  should  be 
cue f Tilly  assessed  to  ensure  that  the  model  selected  is  appropriate . 


The  list  of  uses  and  applications  of  LCC  models  is  a  Ion?  one.  In 
addition  to  estimating  all  or  a  portion  of  total  system  LCC,  LCC  models 
have  found  wide  application  in  break-even  and  sensitivity  analysis,  as 
well  as  cost  effectiveness  and  benefit-cost  analysis  mentioned  in  a 
previous  chapter. 

LCC  models  can  be  used  to  assess  the  value  of  proposed  engineering 
changes.  Other  models  can  and  have  been  used  to  select  an  appropriate 
maintenance  concept  for  a  new  system. 

An  important  use  of  LCC  models  has  been  to  establish  cost  goals,  not 
only  for  acquisition  costs,  but  OSS  costs  as  well.  A  common  technique 
is  to  establish  an  OSS  cost  target  and  reward/penalize  the  successful 
bidder  when  measured  OSS  costs  fall  below/above  the  targeted  costs. 

This  technique  was  used  in  a  limited  sense  in  the  acquisition  of  the 
P-16. 

In  the  final  analysis,  the  concept  of  LCC  is  rapidly  becoming  a  way 
of  life  within  the  DM).  Cost,  in  particular  life  cycle  cost,  is  and 
will  continue  to  be  a  predominant  factor  in  the  acquisition  and  mainten¬ 
ance  of  the  force  structure  of  the  U.S.  -military  establishment. 

PREDICTION  INTERVALS 

In  a  previous  section,  it  was  noted  that  certain  problems  arise 
when  an  attempt  is  made  to  sum  individual  CERs.  When  CERs  are  developed 
statistically,  confidence  intervals  (called  prediction  intervals)  can  be 
confuted  for  each  CER.  Although  a  total  cost  estimate  can  be  found  by 
summing  the  costs  estimated  by  each  CER,  a  problem  arises  in  the  case 
where  a  prediction  interval  is  desired  for  the  total  cost  estimate.  A 
solution  to  this  problem  was  presented  by  J.  A.  Die  Rossi.*  The  pro¬ 
posed  solution  encompasses  two  separate  cases.  The  first  is  for  sunned 
totals  with  equal  variances,  i.e. ,  the  standard  error  for  each  CER  is 
equal.  The  second  case  arises  when  the  standard  errors  are  not  equal. 

In  studying  the  solution  methodology,  it  is  readily  apparent  that  the 
solutions  closely  resemble  Cases  3  and  4  of  the  t-tests  presented  in 
Chapter  8  on  hypothesis  testing. 

Users  should  refer  to  RM-5806  PR  for  the  theoretical  basis  of  the 
solution  and  its  application.  For  direct  application,  a  solution  model 
is  available  on  the  AFLC  CREATE  computer  system. 


*J.  A.  Die  Rossi,  "Prediction  Intervals  for  Sunned  Totals,"  Rand 
Memorandum  RM-5806-PR  (Santa  Monica  CA:  The  Rand  Corporation,  October 


CHAPTER  13 


REGRESSION  ANALYSIS 


Regression  analysis  is  a  technique  for  forecasting  the  value  of  some 
dependent  variable  based  on  the  known  or  easily  predicted  value  of  at  least 
one  other  variable  (independent  variable  or  variables)  and  nuking  an 
assessment  of  the  range  of  uncertainty  in  the  value  of  the  dependent  vari¬ 
able. 

Regression  analysis  is  useless  if  the  independent  variable  is  as  hard 
to  predict  as  the  dependent  variable .  A  way  to  predict  snow  removal  cost 
based  on  the  amount  of  snowfall  could  be  accurate  to  the  penny  and  still 
not  helpful-- it  is  as  hard  to  predict  the  amount  of  snowfall  in  advance  as 
it  is  to  predict  costs. 

There  are  three  skills  involved  in  regression  analysis: 

a)  The  skill  of  determining  what  easily  predictable  (or  known) 
independent  variable  or  variables  cause  the  dependent  variable  to  vary. 

b)  The  skill  of  determining  the  relationship  between  the  depen¬ 
dent  and  independent  variables. 

c)  The  skill  of  determining  whether  or  not  the  relationship  (or 
relationships)  found  are  meaningful. 

The  technique  should  be: 

1.  Decide  on  the  cause  and  effect  relation. 

2.  Gather  the  data,  each  observation  or  case  to  contain  a  value 
for  each  variable. 

3.  Graph  the  data  ae  a  preliminary  test  of  the  hypothesis  stated 
in  step  one. 

4.  Fit  the  equation. 

5.  Evaluate  the  results. 

It  is  essential  that  these  steps  be  followed.  It  is  unforgivable  to 
collect  a  set  of  data  and  then  find  what  "fits  best.” 

The  cause  and  effect  relation  comes  from  knowledge,  there  is  no  way  to 
detect  it  from  the  data.  This  implies  one  must  be  familiar  with  the  item 
under  consideration.  If  the  analyst  lacks  this  personal  knowledge,  expert 
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assistance  must  be  sought.  Engineers  can  sometimes  suggest  relationships. 
There  are  people  at  the  APPRO  (if  one  exists)  that  may  know.  If  there  is  a 
SPO  connected  with  the  items,  someone  there  may  be  able  to  suggest  cause 
and  effect  relations.  Contractor  personnel  may  be  willing  to  provide 
information  but  the  analyst  must  remember  contractor  personnel  are  not 
disinterested  and  their  interest  is  not  necessarily  the  same  as  that  of  the 
Air  Force.  Gathering  the  data  may  be  very  difficult  indeed.  Records  get 
lost,  or  thrown  away,  data  is  not  kept  with  analysis  in  mind.  Events  occur 
over  time  and  time  brings  changes. 

Unless  the  independent  variable  is  time,  one  must  make  the  data  com¬ 
parable  .  If  costs  are  collected  in  dollars,  it  probably  will  be  necessary 
to  use  some  method  of  correcting  for  time,  converting  the  "current  year" 
dollars  to  "constant  dollars"  with  the  use  of  an  appropriate  index  series. 

If  the  independent  variable  is  time,  regression  does  not  apply;  use  a  time 
series  analysis. 

In  many  cases  it  is  necessary  to  correct  for  cost-quantity  effects 
(see  the  Learning  Curve  chapter) .  Graphing  the  data  is  easy  if  there  is 
only  one  explanatory  (independent)  variable,  use  regular  graph  paper  or, 
if  nonlinearity  is  obvious,  use  semi-log  paper  or  log- log  paper.  Each  type 
of  paper  gives  a  characteristic  formula  (equation)  if  the  data  appears  to  be 
a  straight  line. 

Fitting  the  equation  is  a  simple  matter  if  you  have  a  computer  available 

-  At  AFLC  bases  the  pricing  function  has  a  computer  system — 

Copper  Impact — available,  several  stored  programs  are  available.  The 
people  at  the  pricing  shop  will  probably  help  you  use  it. 

-  There  axe  many  other  computer  systems  but  access  may  be  difficult- 
check  with  the  Data  Automation  people  on  your  base. 

-  For  two  variable  problems  some  electronic  calculators — HP65, 

HP38E,  TI58,  TI59  or  equivalent  are  either  wired  to,  or  can  be  programmed 
to  fit  a  regression  line. 

Evaluation  of  the  results  is  not  easy.  'ftere  are  sets  of  statistics 
that  can  be  used  for  evaluation.  These  statistical  tests  assume  there  is 
no  relation  between  the  independent  variable  or  variables  and  the  dependent 
variable  (The  null  hypothesis  or  Ho).  Say  you  are  only  willing  to  take  a 
5%  chance  of  being  wrong.  Then  certain  statistics  are  calculated,  if  the 
calculated  statistic  is  greater  than  some  critical  value  (determined  by  the 
chance  you  are  willing  to  take)  we  say  that  could  have  happened  only  five 
times  in  one  hundred  (probability  .05)  if  there  was  no  relationship.  But 
since  it  did  happen,  we  reject  the  null  hypothesis.  Notice  that  five  per¬ 
cent  of  the  time  we  expect  to  be  wrong,  i.e.,  there  will  be  no  relationship 
among  the  variables  but  by  chance  we  get  a  statistic  greater  them  critical. 

An  example  will  show  what  we  mean. 
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The  following  problem  was  extracted  from  a  paper  by  Mr.  Aubrey  A.  Yawitz 
(1) .  Mr.  Yawitz  does  am  excellent  analysis  but  the  only  part  we  have  copied 
is  the  problem  and  his  cause  and  effect.  The  following  analysis  is  all  our 
own  given  the  cause  and  effect. 

The  problem  is  to  devise  a  cost  estimating  relationship  to  estimate 
labor  hours  in  overhaul,  the  heavier  the  locomotive  the  more  labor  hours 
it  takes  to  overhaul. 

For  our  purposes  we  will  hypothesize  a  linear  relationship 
Labor  hours  =  a  +  b  (weight) 

Hq:  b  »  0 

HQ  is  the  null  hypothesis  that  will  be  tested  after  the  equation  has 
been  fitted. 

The  data  collected  from  a  single  depot  in  the  years  1977  and  1978  ere; 


Observation 

Locomotive 

Gross  Weight 
(tons) 

Unit 
Overhaul 
(labor  hours) 

1 

44 

2208.00 

2 

60 

2625.86 

3 

60 

2425.33 

4 

60 

2501.00 

5 

80 

2361.50 

6 

100 

3092.50 

7 

100 

2913.75 

8 

100 

3214.67 

9 

120 

3481.00 

10 

120 

3332.00 

11 

120 

3003.50 

The  first  step  is  to  plot  the  data  on  a  sheet  of  graph  paper  (see  figure 
13-1).  The  scales  will  be  arbitrary  but  should  have  two  characteristics : 

1.  Wide  enough  so  all  the  data  can  be  plotted. 

2.  Scaled  so  reading  is  moderately  easy. 

The  X  axis  is  gross  weight  in  tons,  and  the  scale  is  2  tons  to  the  1/10 
inch  or  20  tons  for  ONE  LINEAR  INCH  of  axis.  Figure  13-1  shows  a  plot  of 
this  data.  The  Y  axis  is  labor  hours,  the  scale  is  50  hours  to  the  1/10  inch 
or  500  HOURS  FOR  ONE  LINEAR  INCH  of  axis. 

Plotting  is  the  first,  and  rather  rough  test  of  the  hypothesis  of  a 
linear  relationship  between  weight  as  the  causative  variable  and  hours  used 
in  overhaul  as  the  effect. 
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Figure  13-1.  Data  on  Locomotives 


To  plot  the  first  observation  one  reads  weight  equals  44  tons,  hours 
equals  2208.  On  the  weight  axis  read  the  44  then  up  the  Y  axis  to  2208, 
make  a  mark  (I  have  used  an  X)  and  leave  a  number  to  identify  the  point. 

I  used  the  observation  number. 

The  rest  of  the  points  are  plotted  the  same  way. 

When  the  data  has  been  plotted  it  should  be  examined  for  outlines  (data 
points  well  outside  any  pattern) .  The  point  for  observation  five  appears 
to  be  such  a  point. 

Outliers  deserve  to  be  investigated— there  may  be  good  reasons— a 
special  type  of  unit,  one  unit  designed  to  work  in  a  special  environment, 
a  unit  made  under  a  causation  system  different  from  the  rest— are  examples . 
If  such  a  cause  is  found  it  is  reasonable  to  either  adjust  the  data  or 
exclude  that  data  point  from  the  study.  Sometimes  the  outlier  turns  out 
to  be  simply  a  typographical  error  which  can  be  corrected. 


There  is  no  statistical  test  that  should  enable  an  analyst  to  ignore  a 
data  point.  The  job  is  to  test  a  hypothesis.  If  you  exclude  data  that 
does  not  fit  your  hypothesis  even  before  you  do  a  formal  test,  there  is  no 
probability  of  rejecting  that  hypothesis. 

In  the  AFIT/LS  computer  system  there  is  a  computer  program  to  do 
regression.  Usually  the  data  is  set  up  in  a  file  in  computer  memory 
according  to  specific  computer  program  instructions  and  then  some  regression 
program  (there  are  several  of  them)  is  run  according  to  instructions. 

Nearly  all  computer  programs  have  the  same  output,  some  optional,  some 
standard.  For  this  problem  the  analyst  got  the  following  outputs: 

Equation:  Labor  hours  =  1620.904  +  13.827  (NT) 

Height  is  in  tons 

Equation  F  score  38.824 

Standard  error  of  the  b  *  2.21908 

t  for  b  -  6.231 

Arithmetic  Mew  of  Y  »  2832.646 

Standard  deviation  of  Y  ■  430.00 

Variation  in  Y  «  1849770.35 


90%  Confidence  Level  Prediction  Intervals 


x  value 

Lower  Bound 

Point  Estimate 

Upper  Bound 

0 

1102.407 

1620.904 

2139.401 

44 

1813.028 

2229.289 

2645.550 

60 

2057.575 

2450.520 

2843.465 

80 

2349.257 

2727.059 

3104.860 

100 

2623.731 

3003.598 

3383.464 

120 

2881.265 

3280.136 

3679.008 

160 

3355.293 

3833.214 

4311.134 

For  those  who  do  not  have  access  to  a  computer  the  method  of  calculation 
of  these  numbers  on  a  desk  calculator  is  shown  in  the  section  on  calculation 
at  the  end  of  this  chapter. 

Evaluation:  One  data  point  should  be  questioned;  observation  5.  But 
unless  clear  evidence  of  error  is  found,  it  should  be  kept  in  the  data  set. 
It  was  kept  in  the  data  set. 

Statistical  significance.  There  are  three  measures  of  statistical 
significance : 


1.  For  the  equations:  An  F  score  is  used.  Technically,  F  is  a 
relation  of  variances,  the  "explained  variance"1  and  the  "unexplained 
variance" 2. 

When  the  number  value  for  F  is  determined  it  is  compared  to  a 
standard  from  tables.  F  tables  are  divided  into  levels  of  significance , 
each  of  which  has  two  arguments.  One  less  than  the  number  of  parameters 
to  be  estimated  in  the  head  and  the  number  of  observations  minus  the  number 
of  parameters  in  the  stub.  In  our  case  the  arguments  are  2-1  or  1  in  the 
head  and  11-2  or  9.  in  the  stub. 

If  the  calculated  value  of  F  is  greater  than  the  table  value, 
one  considers  the  result  significant.  For  the  case  at  hand  we  chose  the 
95%  level  of  confidence  with  column  1,  row  9  and  got  5.12  from  the  table. 

This  means  that  if  there  is  no  relation  between  cost  and  weight  we  would 
expect  am  F  score  of  5. 12 3  or  larger  only  one  time  in  twenty  trials. 

Since  the  F  from  the  data  is  38.824  we  consider  our  results  significant  at 
the  95%  level  and  reject  the  hypothesis  of  no  relation  between  them. 

2.  The  significamce  of  the  b  of  the  equation.  In  two  variable 
problems  these  two  tests  are  equivalent,  in  larger  problems  they  are  not. 

In  this  case  we  use  a  calculated  "t"  statistic  (parauneter  b  divided  by  its 
own  standard  error)  in  our  came  6.231  (13.827/2.219).  This  score  is  com- 
pared  with  a  standard  "t"  table.  The  "t"  table  has  two  arguments ,  level  of 
confidence  in  the  head  amd  number  of  observations  less  number  of  estimated 
parameters  in  the  stub.  In  our  case  the  stub  value  is  11  minus  2  or  9  at 
the  95%  confidence  level. 

In  the  table  column  for  95%  confidence  level  row  9  we  find  2. 262 4 . 
The  "t"  for  our  problem  is  6.23.  Since  6.231  is  greater  than  2.262  we  reject 
the  hypothesis  that  b  »  0.  This  test  is  exactly  analogous  to  the  test 
for  the  equations,  if  b  were  indeed  0,  i.e.,  there  was  no  statistical 
relationship  between  cost  amd  weight,  only  5%  of  the  time  would  a  sample  of 
11  show  a  "t"  am  large  or  larger  than  2.262.  Remember,  we  expect  to  reject 
Hq  some  five  percent  of  the  time  when  it  is  true. 

3.  The  third  test  of  significamce  is  not  so  precise  as  the  others 

(it  is  the  test  of  H:  a  -  0.  There  are  very  precise  tests  for  this  hypothesis 

but  they  are  uncommon  in  canned  computer  programs.  This  much  cam  be  said 
if  the  prediction  interval  for  X  ■  0  does  not  include  T  ■  0  the  null 
hypothesis  can  not  be  rejected  at  the  specified  level  of  confidence.  In  our 
case  the  bounds  for  the  90%  prediction  interval  when  X  ■  0  are  1102.407  to 

2139.401.  This  interval  does  not  include  zero  hence  the  hypothesis  a  ■  0 

can  be  rejected  at  the  90%  level  of  confidence.  This  is  a  conservative  test 
of  significamce  but  whether  or  not  a  ■  0  is  not  really  very  importamt. 

Is  regression  worth  the  trouble?  We  know  we  could  use  univatriate 
statistics i  estimate  the  new  cost  as  the  average  of  the  old  costs  with  a 


10ften  called  "me am  regression  sum  of  the  squares" 

^ften  called  "mean  error  sum  of  squares" 

3Table  from  CRC  Stamdard  Math  Tables,  16th  Edition,  The  Chemicatl  Rubber 
Co.,  Cleveland,  OH,  1968,  p.  588. 

4 Ibid,  p.  583 
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statistical  probability.  If  you  use  only  the  Y  values  as  predictors  you 
will  always  predict  the  net  value  of  Y  as  the  mean  of  the  set  within  the 
interval. 


K  an  of  Y  +  appropriate  "t"  [standard  deviation  of  Y2]  +  standard 
deviation  of  Y/sample  size. 

Say  a  90%  prediction  interval  for  the  next  Y  would  be 

2832.65  £  1.812  /430.092  +  (430.09z/ll) 

-  2832.65  +  1.812  f 20179 3. 54 

-  2018.65  <  Y  <  3646.63 

This  says  we  expect  Y  to  be  between  2019  and  3647  but  5%  of  the  time  Y  will 

be  greater  than  3647  and  5%  of  the  time  Y  will  be  less  than  2019.  Our  "best 

guess  all  the  time  is  2832.65  or  2832.65  +  28%  of  2832.65. 

If  we  use  regression  and  the  same  confidence  level  the  "best  guess" 
depends  upon  the  value  of  X,  the  weight  of  the  locomotive  in  tons. 

Our  interval  estimates  would  be: 

for  a  44  ton  locomotive  we  say  2229.892  +  18.67%  of  2229.892 

for  a  60  ton  locomotive  we  say  2450.520  +  16.03%  of  2450.520 

for  a  80  ton  locomotive  we  say  2727.059  £  13.85%  of  2727.059 

for  a  100  ton  locomotive  we  say  3003.598  +  12.65%  of  3003.598 

for  a  120  ton  locomotive  we  say  3280.136  +_  12.16%  of  3280.136 

for  a  160  ton  locomotive  we  say  3833.214  +_ 12.47%  of  3833.214 

For  a  different  sample  or  data  set  the  numbers  would  all  be  different.  Our 
estimates  are  made  from  our  specific  example . 

We  have  considerably  improved  our  ability  to  predict  the  value  of  the 
manhours  needed  to  overhaul  a  locomotive  when  we  know  the  weight  of  that 
locomotive . 

Instead  of  a  single  estimate  the  estimate  depends  upon  the  weight  and 
none  of  the  intervals  are  as  wide  as  the  28%  we  would  have  with  only  costs 
instead  of  a  cost  estimating  relationship. 

One  good  question:  Are  those  estimates  good  enough?  The  answer  depends 
upon  the  needs.  If  the  answer  is  yes,  well  and  good.  If  no,  then  one 
must  consider  alternatives — are  there  other  characteristics  in  addition 
to  weight  which  are  independent  of  weight  and  might  cause  costs?  Is  it 
possible  to  extend  the  sample?  Are  there  other  locomotives  on  which 


overhaul  data  is  available?  Finally,  if  this  CER  is  inadequate,  what  do 
you  have  that  is  better? 

There  is  one  very  bad  habit,  try  several  variables  on  which  one  has 
data  to  find  which  gives  the  best  fit.  This  is  wrong. 

Remember  tests  of  significance!  If  there  is  no  relation  between 
variables  the  F  score  will  be  so  luge  some  specified  percent  of  the  time. 
Indiscriminant  fitting  is  wrong  because  consecutive  tests  destroy  probabil¬ 
ities.  If  one  tests  two  separate  models  the  probability  of  accepting  at 
least  one  (assuming  no  relations  among  them)  is  about  10%  instead  of  the 
5%,  or  if  you  are  using  90%  level  of  confidence  the  probability  of  accepting 
at  least  one  is  nearly  20%.  If  one  is  depending  upon  statistical  tests 
the  rules  of  probability  must  be  checked  carefully. 

SOLUTIONS  WITH  A  DESK  CALCULATOR 

To  calculate  the  numbers  given  in  the  example  problem,  construct  a 
worksheet.  Worksheets  tend  to  be  tedious  and  subject  to  errors.  Careful 
work  will  help  prevent  errors  but  a  check  column  is  recommended. 

Table  13-1  shows  the  worksheet}  the  steps  in  calculation  are: 

1.  Copy  the  data  into  columns  1,  2,  and  3 

2.  Sum  column  2,  get  the  sum  of  X  ■  964 

3.  Sum  column  3,  get  the  sum  of  Y  ■  31159.11 

4.  Fill  in  column  7,  K  ■  X  +  Y 

5.  Sum  column  7,  get  the  sum  of  K  -  32123.11 

6.  Check  your  addition  by  checking: 

does  the  sum  of  K  ■  sum  of  X  +  sum  of  Y? 

does  32123.11  -  964  +  31159.11? 

It  does,  so  we  know  the  addition  is  correct. 

7.  Get  the  mean  of  X.  Divide  the  sum  of  X  (sum  of  column  2)  by 
the  number  of  observations  (11) ,  get  mean  of  X  equals 

964/11  ■  87.6363...  a  continuing  decimal  the  6363  will  go  on 

forever 

8.  Get  the  mean  of  Y.  Divide  the  sum  of  Y  (sum  of  column  3)  by 
the  number  of  observations  (11) ,  get  mean  of  Y  equals 

31159.11/11  ■  2832.6463. . .another  continuing  decimal 
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Locomotive 


variation  in  K  is  95883396.17  -  (32123.11  tines  2920.282727)  or  2074832.90 


9.  Get  the  mean  of  K,  divide  the  sum  of  K  (sun  of  column  7)  by  11 
Mean  of  K  -  32123.11/11  -  2920.282727... 

10.  Another  check — does  the  mean  of  K  equal  the  mean  of  X  plus 
the  mean  of  Y?  Does  2920.282727  -  87.636363  +  2832.646364?  It  does  so 
the  means  have  been  calculated  correctly. 

11.  Accumulate  the  sum  of  the  X  squared  in  a  calculator,  this 
means  square  44,  square  60  and  add  to  the  square  of  44  and  so  on  until  all 
eleven  numbers  have  been  squared  and  the  sum  accumulated.  Resist  the 
temptation  to  square  individually,  write  down  and  then  sum.  This  almost 
guarantees  errors  will  creep  in.  Write  the  sum  at  the  foot  of  the  X2  column 
column  4. 

12.  Accumulate  the  sum  of  the  cross  products  of  X  and  y  in  the  calcu 
lator.  This  means  take  each  observation,  multiply  the  X  and  Y  (for  Obs.  1 
use  44  times  2208.00)  and  accumulate.  Write  the  sum  at  the  foot  of  the  XY 
column  (col.  5) . 

13.  Accumulate  the  sum  of  the  squares  of  Y  in  the  calculator  auid 
write  the  sum  at  the  foot  of  the  Y2  column  (col.  6) . 

14.  Accumulate  the  sum  of  the  squares  of  K  (col.  7)  in  the  calculator 
and  write  the  sum  at  the  foot  of  the  K2  column  (coL  8) . 

15.  Check  the  accuracy.  If  the  sum  of  K2  equals  the  sum  of  X2  plus 
the  sum  of  Y2  +  twice  the  sum  of  XY,  the  worksheet  summations  are  correct. 

Does  95883396.17  -  92336.00  +  90112509.37  +  (2  times  2839275.40) 

It  does,  so  the  major  calculations  are  correct. 

16.  The  variation  in  X  is  equal  to  the  sum  of  the  squares  of  X  minus 
(the  sum  of  X  times  the  swan  of  X) .  Remember  to  do  the  multiplication  in¬ 
side  the  parenthesis,  then  subtract. 

17.  The  covariation  in  X  and  Y  is  equal  to  the  sum  of  the  cross 
products  (col.  5)  minus  (the  mean  of  X  times  the  sum  of  Y) .  Remember, 
multiply  numbers  in  parenthesis,  then  subtract  product. 

18.  The  variation  in  Y  is  the  sum  of  the  squares  of  Y  (col.  6) 
minus  (the  sum  of  Y  times  the  mean  of  Y) . 

19.  The  variation  in  K  is  the  sum  of  the  squares  of  K  (col.  8) 
minus  (the  sum  of  K  times  the  mean  of  K) . 

20.  The  last  check  of  the  worksheet — does  the  variation  in  K 
equal  the  variation  in  X  plus  the  variation  in  Y  plus  twice  the  covariation 
in  X  and  Y? 


Does  2074832.90  -  7854.5454  +  1849769.75  +  (2  times  108604.325)? 

To  the  second  decimal  place  it  does  and  the  difference  is  due 
to  rounding.  The  calculations  on  the  work  sheet  are  correct. 

To  find  the  regression  equation:  Y  +  a  +  bX  +  e  the  calculations  are 
simple  once  the  worksheet  is  complete. 

b  *  covariation  in  X  and  Y  divided  by  variation  in  X. 
b  -  108604.325  divided  by  7854.5454 
b  -  13.8269396 

and 

a  *  mean  of  Y  minus  (b  times  mean  of  X) 
a  -  2832.646363  -  (13.8269396  times  87.6363) 
a  -  2832.646363  -  1211.7427 
a  -  1620.90366 
or  in  easier  terms 

Y  -  1620.904  +  13.827X  +  e 

In  order  to  calculate  the  numbers  we  need  for  statistical  inference  we 
need  a  table  of  variation  and  variance.  Table  13-2  shows  the  needed  table. 

The  total  variation  is  simply  the  variation  in  Y,  thats  all  there  is 
in  the  problem. 

Degrees  of  freedom  is  a  statistical  idea.  For  total  variance  it's 
always  the  number  of  observations  less  one  for  regression  problems. 

The  total  variance  is  the  variation  divided  by  degrees  of  freedom. 

The  standard  deviation  of  Y  is  the  square  root  of  the  total  variance. 

The  variation  in  Y  that  is  "explained”  by  the  variation  in  X  is  a 
statistical  concept.  We  can  avoid  a  lot  of  theory  if  we  simply  take  it  as 

Explained  variation  ■  b  times  covariation  in  X  and  Y 

In  this  case 

Explained  variation  ■  13.8269396  times  108604.325  (the  covariation 

in  X  and  Y  was  calculated  on  the  worksheet) 

Explained  variation  -  1501665.444 


In  two  variable  regression  there  is  only  1  degree  of  freedom  in  the 
explained  variation,  hence  explained  variation  and  variance  are  the  same 
(variance  is  always  variation  divided  by  degrees  of  freedom) . 

The  unexplained  variation,  that  is  the  variation  of  Y  not  explained 
by  the  variation  in  X  is  total  variation  minus  explained  variation.  The 
degrees  of  freedom  in  two  variable  regression  for  the  unexplained  is  always 
the  number  of  observations  less  two,  here  11-2  »  9. 

The  unexplained  variance  is  unexplained  variation  divided  by  degrees  of 
freedom,  here  348104.306  divided  by  9  ■  38678.256  is  the  unexplained 
variance. 

In  statistics,  the  standard  error  of  estimate  is  the  square  root  of 
the  unexplained  variance , 

V 38678. 256  -  196.66788 

This  is  our  estimate  of  the  standard  deviation  of  the  e  in  the  equation 
Y*a+bX*e,  and  the  e  is  distributed  around  that  line  Yc  “  a  +  bX. 

F  score  and  standard  error  of  b  are  calculated  on  the  table  of  variation 
and  variance ,  the  only  statistic  we  have  to  calculate  is  T  score  for  b  ■ 
b  divided  by  its  standard  error  or  t  for  b  ■  13.8269396  divided  by  2.21908  * 
6.231. 

In  making  estimates  one  uses  the  estimating  equation  and  assigns  the 
relevant  X  value.  If  one  wishes  to  estimate  the  labor  hour  overhaul  cost 
of  a  44  ton  locomotive  one  says: 

Yc  ■  1620.904  +  13.826936  (44) 

Yc  -  1620.904  +  608.385 
Yc  -  2229.289 

This  is  our  point  estimate  of  the  value  of  Y.  To  construct  a  pre¬ 
diction  interval  we  need  a  standard  error  of  prediction  at  X  ■  44.  This  is 

- — -  «*— (A,,l3r.^rin  v  ~ ofl1*' 


The  standard  error  of  estimate  is  from  the  table  of  variation  and  variance; 
the  number  of  observations  (11)  is  from  the  worksheet.  Also  from  the  work¬ 
sheet  we  get  the  (a)  variation  in  X,  and  (b)  mean  of  X. 


The  assigned  value  of  X  is  44. 


196.66788  times 


1_  (44  -  87.636363) 2 

11  7854.5454 
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-  196.66768  tints 


.090909091  + 


(-43.636363) 2 
7854.5454 


-  196.66788 


1.0909091 


1904.132176 

7854.5454 


-  196.66788  V 1.0909091  +  .2424242 


-  196.66788V  1.9333333... 

Standard  error  of  prediction  at  X  of  44 

-  196.66788  tines  1.15470054 
«  227.0925064 

A  prediction  interval  is  always:  a  point  estimate  +  some  appropriate 
t  times  a  standard  error  of  prediction.  From  a  t  table  we  find  a  90% 
confidence  level  of  t  at  9  degrees  of  freedom  is  1.8333  so  our  prediction 
interval  is: 

2229.289+1.833  C227. 0925064) 

2229.289  +  416.261 

Our  lower  bound  is:  1813.0284 

Estimate:  2229.289 

Our  upper  bound  is:  2645.550 

For  practice  you  may  wish  to  calculate  some  of  the  other  intervals  shown 
in  the  text. 
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There  are  many  good  &ets  of  tables  for  statistics;  the  one  used  here 


is: 

Standard  Math  Tables,  Chemical  Rubber  Co.,  Cleveland,  OH.  These  tables 
are  revised  frequently  and  the  current  revision  will  be  the  only  one 
available . 
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CHAPTER  14 


THE  LEARNING  CURVE 


There  Is  considerable  evidence  that  we  learn  from  experience.  A 
rather  extensive  body  of  theory,  called  learning  curve  theory,  has  grown 
around  the  idea.*  This  theory,  although  not  without  its  detractors,  has 
proven  very  useful  in  the  area  of  cost  estimating. 

The  learning  curve  gives  us  a  way  of  estimating  the  direct  costs  of 
some  items.  Hie  use  of  learning  curve  techniques  will  result  in  an 
"estimate,"  or  an  "expected"  value  that  should  be  close  "on  the  average." 

No  calculations  in  this  paper  are  meant  to  indicate  that  we  can  estimate 
costs  accurately  to  3  or  4  digits.  He  can  use  theoretical  constructs  to 
give  a  "best  estimate”  and  that  is  all. 

The  -three  basic  tools  for  use  of  learning  curve  theory  sure  log-log 
graph  paper,  a  good  calculator,  or  access  to  a  computer  terminal. 

a.  One  of  the  most  useful  tools  for  dealing  with  learning  curves 
is  graph  paper  with  coordinates  scaled  so  numbers  can  be  plotted  as 
logarithms .  Both  x  and  Y  axes  are  so  scaled.  This  type  of  graph  paper 
is  called  log-log  paper.  It  has  the  characteristic  that  equal  ratio 
changes  plot  as  a  straight  line.  Say  the  three  points  (XjY^) ,  (2xlr  .8Y1) , 
'W  (4Xj_,  .64Y]_)  (each  X  is  twice  the  previous  X  and  each  Y  is  .8  times  the 

previous  Y)  these  three  points  will  plot  in  a  straight  line  on  log-log 
paper.  (To  illustrate  this  let  ■  1  and  Y^  *  1000,  plot  the  points 
(1,1000),  (2,800),  (4,640),  and  see  if  they  fall  on  a  line.) 

Figure  14-1  is  a  sheet  of  log-log  paper.  Note  the  scales  are 
peculiarly  spaced.  There  is  no  zero  because  zero  does  not  have  a  logarithm, 
the  distance  from  four  to  five  is  much  less  than  the  distance  from  one 
to  two,  in  fact,  the  distance  from  five  to  ten  is  the  same  as  the  distance 
from  one  to  two.  The  change  in  distance  represents  a  change  in  ratio, 
from  four  to  five  is  from  X  -  4  to  1.25X  -  5.  She  distance  from  one  to 
two  is  from  X  «  1  to  2X  ■  2,  this  latter  is  the  same  as  the  distance  from 
X  -  5  to  2X  ■  10,  the  same  ratio  as  1  to  2. 

It  is  important  to  remember  that  equal  distances  on  the  paper 
represent  equal  changes  in  proportion — if  one  goes  from  X  ■  1  to  10X  *  10 
the  result  is  a  tenfold  increase,  so  the  second  time  this  distance  is 
covered  one  is  going  from  X  ■  10  to  10X  ■  100,  if  there  was  a  third  cycle 
it  would  be  from  X  *  100  to  10X  ■  1000.  This  scaling  requires  care  in 
reading  or  in  plotting. 


♦NOTE:  Yelle,  Louis  E.,  "The  Learning  Curve  Historical  Review  and  Com¬ 
prehensive  Survey,"  Decision  Sciences,  Vol.  10,  No.  2  (Apr  79)  contains 
an  extensive  bibliography. 
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The  divisions  of  logarithmic  scales  also  can  cause  trouble.  Mote 
the  space  from  1  to  3  and  10  to  30  have  20  minor  divisions  per  major 
division.  From  3  (or  30)  to  6  or  (60)  there  are  10  minor  divisions  per 
major  division.  From  6  to  10  (or  60  to  100)  there  are  five  minor  divisions 
per  major  division. 

Also,  note  that  things  tend  to  crowd  up,  measuring  from  1  two  minor 
divisions  to  1.1  is  about  4.5  milimeters  on  this  paper.  Starting  at  five 
and  going  right  about  4.5  mm  gets  you  to  5.5  or  5  times  1.1.  If  you  start 
at  9  and  go  to  the  right  about  4.5  mm  you  get  to  9  times  1.1  or  9.9,  you 
have  nearly  gone  out  of  the  9  range.  This  explains  the  crowded  conditions 
in  the  range  nine  to  ten  or  90  to  100. 

This  kind  of  paper  can  be  secured  from  any  university  bookstore. 

b.  The  second  basic  tool  is  a  good  calculator.  One  that  has  logarithms, 
and  can  handle  powers  and  fractional  exponents.  While  a  good  analyst  can 
complete  all  the  estimates  using  graph  paper,  a  calculator  allows  more 
accuracy  and  provides  a  check  on  the  graphical  results.  About  the  only  thing 
you  cannot  easily  do  with  a  calculator  is  calculate  the  slope  of  the  curve 
from  actual  data. 

c.  The  third  basic  tool  is  access  to  a  computer  terminal.  There  is 
generally  a  learning  curve  program  available  which,  given  data,  will  do 
all  the  computations.  This  does  not  relieve  you  of  the  responsibility  for 
graphing  the  data!  A  graph  can  highlight  changes  in  the  production  process 
that  would  never  be  flagged  otherwise.  Once  again,  access  to  a  terminal 
or  even  a  fancy  calculator  are  not  required,  but  they  cam  help. 

Formulations ; 

The  learning  curve  comes  in  two  variants 

Unit  Curve  (U.C.)  Yx  -  AX*5 

where:  Yx  is  the  cost  of  the  Xth  unit,  A  is  a  constant  (the  cost  of  unit 
one) ,  X  is  the  sequential  unit  number  (usually  limited  to  production  units) 
and  b  is  a  constant,  a  negative  decimal  fraction,  and 

Cumulative  Average  (C.A.)  Yx»  AX15 

where:  Yx  is  the  average  cost  for  units  1  to  X  inclusive 

A  is  the  cost  of  unit  one 
X  is  the  sequential  unit  number 
b  is  a  negative  decimal  fraction. 


What  is  Being  Estimated? 

If  Yx  or  Yx  is  in  dollars  it  must  be  in  constant  dollars,  but  usually 
cost  is  expressed  in  real  terms,  for  example: 

Labor  hours  for  manufacturing  labor 
Tooling  hours  for  direct  tooling 
Engineering  hours 
Pounds  of  material 

Slope : 

One  often  hears  about  "slopes"  of  learning  curves,  the  slope  is  the 
percent  the  Y  is  of  the  former  Y  as  X  doubles. 

Slope  *  UML  *100% 

Y(X)b 

If  the  "unit  curve"  Yx  *  AX*5  is  the  learning  curve  and  has  a  80% 
slope,  the  points  (X,Y) ,  (2X,  .8Y)  are  on  the  curve.  For  instance,  if 
A  ■  1000  the  following  table  describes  an  80%  curve. 

Unit  Cost  in  Labor  Hours 

1  1000 
2  800 

4  640 

8  512 

16  409.6 

The  value  of  intermediate  units  can  be  determined  by  the  formula. 


Yx-  AX 


-.321928 


thus,  unit  3  would  cost 


„  321928 

Y3  *  1000(3) 

-  1000 (.7021037) 

Y^  •  702.1  or  unit  3  costs  about  702.1  labor  hours 


and  unit  5  would  cost 


-.321928 


Y5  -  1000(5) 

-  1000 (.59563734) 

Y^  *  595.6  approximately 
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Percent  slopes  are  applied  to  cost  as  X  doubles, 
70%  curve  looks  like  this 


If  A  «  1000,  a 


Cost  In  Labor  Hours 


1000 

700  (70%  of  1000) 
490  (70%  of  700) 
353  (70%  of  490) 
240.1  (70%  of  343) 


For  this  70%  curve  the  formula  is 


-.514573 


Yx  -  1000X 


Notice  this  formula  differs  from  the  previous  one  only  in  the  exponent- 
that  difference  makes  it  a  70%  curve  not  an  80%  curve. 

As  with  an  80%  curve  the  value  of  units  3  and  5  could  be  approximated 
with  the  following  results: 


Y3  »  568.2  approximately 


Ys  -  436.8 


One  cam  use  this  type  of  calculation  to  estimate  direct  costs  of  a 
particular  unit,  the  total  cost  of  a  specified  number  of  units  or  the 
average  cost  of  some  specified  number  of  sequential  units. 


ale  of  Unit  Curve: 


A  single  example  of  results  from  an  85%  curve 


Unit  Curve  Y  -  1500X 

A 


-.234465 


i 
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Table  14-1 


Unit 

Cost  of 

Cost  of 

Average  Cost 

No. 

Unit 

X  Units 

For  X  Units 

(X) 

(Yx) 

(TCX) 

(TC*/X) 

1 

1500.0 

1500.0 

1500.0 

2 

1275.0 

2775.0 

1387.5 

3 

1159.4 

3934.4 

1311.5 

4 

1083.8 

5018.2 

1254.5 

5 

1028.5 

6046.7 

1209.3 

6 

985.5 

7032.2 

1172.0 

7 

950.5 

7982.6 

1140.4 

8 

921.2 

8903.8 

1113.0 

9 

896.1 

9799.9 

1088.9 

10 

874.2 

10674.2 

1067.4 

NOTE:  Detail  may  not  add  because  of  rounding. 


This  is  a  mathematical  construct  called  the  unit  curve  or  Boeing 
Formulation  of  the  learning  curve.  Actual  data  would  be  subject  to  many 
random  influences  and  would  be  somewhat  different. 

If  you  need  practice  using  log-log  paper,  plot  the  data  above.  (At 
least  the  Cost  of  Unit  X  and  Average  Cost  for  X  Unit.)  Figure  14-2  shows 
the  results  of  this  plot.  Note  that  the  unit  line,  the  one  that  shows  the 
cost  of  unitx,  is  straight  on  log- log  paper,  but  the  plot  of  the  averages 
is  not  a  straight  line. 


Example  of  Cumulative  Average  Curve: 

There  is  a  different  form  of  the  learning  curve.  The  Cumulative 
Average  Curve. 


,x-A*b 

The  mathematical  function  is  identical  but  it  has  a  different  meaning. 
This  curve  is  increasing  in  popularity .  in  1970,  95%  of  the  curves  studied 
by  Defense  Contract  Audit  Agency  were  unit  type.  Today  many  more  Cumula¬ 
tive  Average  Curves  would  be  found  but  no  recent  study  is  available. 

Using  the  values  of  A  and  b  as  before  A  -  1500,  b  “  -.234465,  an  85% 
cumulative  average  curve  would  give  results  like  Table  14-2. 


X  ■  U  I  ■  3  l  W  "TT  T  r  * 


85*  Cumulative  Average  Curve  Y  -  1500X-*234465 

A 


Table  14-2 


Unit 

Average  Cost 

Total  Cost 

Cost  of 

No. 

For  X  Units 

of  X  Units 

Unit  X 

(X) 

<V 

(TV*V 

(Y^-IC 

1 

1500.0 

1500.0 

1500.0 

2 

1275.0 

2550.0 

1050.0 

3 

1159.4 

3478.1 

928.1 

4 

1083.8 

4335.0 

856.9 

5 

1028.5 

5142.5 

807.5 

6 

985.5 

5912.8 

770.3 

7 

950.5 

6653.4 

740.6 

8 

921.2 

7369.5 

716.1 

9 

896.1 

8064.9 

695.4 

10 

874.2 

8742.3 

677.4 

NOTE:  The  numbers  In  column  two  of  Table  14-2  are  identical  to  column  one 
of  Table  14-1  but  this  time  instead  of  being  the  cost  of  unit  X  they  are 
the  average  cost  for  units  1  to  X  inclusive. 

The  average  is  cost  per  unit)  therefore,  if  one  multiplies  the  average 
by  the  number  of  units,  the  product  will  represent  the  total  cost  of  that 
many  units,  or  as  the  table  says,  the  total  cost  of  x  units  (units  1 
through  X  inclusive) . 


TCX  "  XYX  *  XA*b  *  AJ(b+1 

The  last  column  is  the  cost  of  unit  X  which  is,  of  course,  the  differ¬ 
ence  between  the  total  cost  of  X  units  and  the  total  cost  of  (X-l)  units. 
Using  the  X  ■  5  row  as  an  example. 

Y5  -  1500(5"* 2 34465 )  -  1028.5 

The  cost  of  unit  five  is: 

Y5  -  (5) (1028.5)  -  (4) (1083.75)* 

-  5142.5  -  4335.0  -  807.5 

•Rounding  makes  this  operation  appear  incorrect. 

This  data  is  plotted  at  Figure  14-3.  Notice  the  cumulative  average 
curve  is  a  straight  line  while  the  unit  curve  is  not  a  straight  line. 
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Choosing  *  Formulation: 


The  coice  of  which  fora  of  the  learning  curve  to  use,  unit  or  cumulative 
average,  is  not  as  difficult  as  it  would  appear.  Perhaps  the  analyst  knows 
the  contractor  who  furnished  the  data  and  either  the  analyst  or  someone 
familiar  with  the  company  knows  which  form  that  company  uses.  If  this 
knowledge  is  lacking,  the  analyst  simply  has  not  done  the  preparation  needed 
for  a  cost  estimate. 

Be  Sure  LC  Theory  Applies: 

there  is  a  strong  probability  the  learning  curve  is  misapplied  on 
occasion.  There  are  instances  where  the  learning  curve  theory  does  not 
apply.  For  example: 

a.  If  the  Air  Force  is  buying  commercial  items  from  a  contractor 
who  is  also  making  the  item  for  sale  to  the  public,  the  learning  curve 
makes  no  sense.  Even  if  one  knew  the  company  had  a  learning  curve,  how 
does  the  AF  know  which  number  items  it  bought? 

b.  Many  analysts  are  convinced  there  is  no  learning  curve  for  mass- 
produced  items  (those  whose  production  is  at  or  above  60,000  units 
annually) . 

c.  Some  accounting  systems  would  not  reveal  a  learning  curve  even 
if  it  existed.  The  comptroller  of  one  of  the  "big-three"  automobile 
companies  testified  under  oath  his  company  had  no  way  to  detect,  the  cost 
of  a  sedan  as  it  came  off  the  assembly  line.  The  best  the  company  could 
do  was  know  how  much  it  cost  to  run  that  line  for  a  period  of  time  and 
by  knowing  how  many  cars  were  made  during  that  period  they  could  furnish 
the  average  cost  of  the  cars  made  during  that  period. 

c.  One  author  claims  learning  curves  are  due  to  undetected  nonrecurring 
costs  in  lot  one.  An  example  will  be  given  later. 

Given  a  learning  curve  problem,  what  should  an  analyst  do?  As  a  first 
step  he  should  plot  the  data  that  purports  to  be  a  learning  curve,  plot 
both  the  unit  values  and  the  cumulative  average  values. 

Plotting  Lot  Data. 

The  plotting  done  above  is  easy,  the  values  for  each  unit  were  given, 
this  is  rarely  the  case.  One  usually  has  data  on  a  lot  (direct  cost  data) 
and  it  is  necessary  to  edit  the  data  in  order  to  plot  it.  One  needs  a 
point,  a  specific  X  and  a  specific  Y  value  to  plot.  The  data,  however, 
looks  like  that  in  Table  14-3.  The  data  in  Table  14-3  is  hypothetical)  it 
follows  a  perfect  83*  learning  curve,  unit  type,  with  a  unit  one  value  of 
2500. 
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If  this  data  is  to  be  plotted  it  is  necessary  to  find  sons  way  to 
represent  each  lot  with  one  (or  more)  points  on  a  sheet  of  log- log  paper. 

Die  solution  is  to  find  a  single  point  to  represent  the  data  for  the  unit 
curve  and  another  point  to  represent  the  same  lot  for  the  cumulative 
average  curve. 

To  accomplish  this  job  in  an  orderly  way  it  iB  advisable  to  construct 
a  work-sheet,  a  guide  for  the  process.  worksheets  are,  of  course,  personal 
and  each  analyst  feels  free  to  make  their  own,  but  one  such  as  Figure  14-4 

is  recommended .  if  you  choose  this  form  leave  room  on  the  right  for  three 

more  columns,  we  are  going  to  get  the  points  for  the  unit  curve  first  then 
later  the  points  for  the  cumulative  average. 

Die  start  of  the  editing  process  is  shown  in  Figure  14-4a.  The  informa¬ 
tion  available  has  been  entered  and  a  zero  placed  in  the  cumulative  units 
column. 

The  columns  are: 

Lot  Nunfeer:  Simply  a  way  of  identifying  the  lot. 

Lot  Size:  The  number  of  units  in  the  lot. 

Lot  Value:  The  total  cost  of  the  lot  in  units  specified, 

in  this  case  labor  hours. 

Cumulative  Units:  The  total  number  of  units  that  were,  or  will  be, 

produced  when  this  lot  is  completed.  This  column 
represents  a  summation  of  the  lot  size  through 
the  lot  being  entered. 

Lot  Midpoint:  A  convenient  method  of  identifying  the  middle  of 

the  lot.  Fractional  units  are  permissable  in 
this  column. 


rigor*  14-4.  Partial  Worksheet  for  Estimating  Plot  Points 


rigor*  14-4a.  Layout  Start  of  Edit 


Lot 

no. 

Lot 

Sis* 

Lot 

Value 

(Labor 

Hours) 

Casuist! v* 
Units 

Lot 

Mid¬ 

point 

Lot  Plot  Points 
unit  Curve 

!!*  !!i 

0 

1 

10 

16967 

2 

15 

17424 

3 

20 

19242 

4 

20 

17017 

5 

40 

30341 

Figur*  14-4b.  Worksheet  coopleted  for  unit  Curv* 


Lot 

NO. 

Lot 

sis* 

Lot 

valu* 

(Labor 

Boors) 

Cumulative 

Units 

Lot 

Mid- 

Point 

Lot  Plot  Points 

Unit  Curve 

PP  PP  * 

X  Y 

1 

10 

16967 

0 

10 

3.33 

3.33 

1697 

2 

15 

17424 

25 

7.5 

17.5 

1162 

3 

20 

19242 

45 

10 

35. 

962 

4 

20 

17017 

65 

10 

55. 

851 

5 

40 

30341 

105 

20 

85.0 

759 

•Three  digit  accuracy  is  enough  for  plotting 


PPX  is  the  X  plot  point  for  the  lot-- the  midpoint  of  the  lot  plus 
cumulative  units  through  the  previous  lot,  fractions  permissible  in  this 
column. 

PPYS  The  Y  plot  point,  the  total  value  of  the  lot  divided  by  number 
of  units  in  lot  (lot  value)?  (lot  size),  the  average  value  for  the  lot. 

A  zero  was  entered  in  the  cumulative  units  because  there  has  been  no 
previous  production,  if  this  work  sheet  were  a  continuation,  total  previous 
production  would  be  entered  here— -an  uncommon  occurrence. 

I  prefer  to  fill  in  cumulative  units  for  each  lot  as  the  last  entry 
for  the  lot. 

When  the  work  sheet  is  prepared,  entries  calculated  are  usually 
entered  only  to  accuracy  one  can  plot. 

Step  1  Find  lot  mid-point  for  the  first  lot.  If  the  lot 
size  is  less  than  ten,  the  lot  midpoint  is  the  lot 
size  divided  by  2.  This  is  not  true  for  the  example. 

If  the  lot  size  is  10  or  more,  as  here,  the  lot  mid¬ 
point  is  the  lot  size  ?  3,  10  i  3  *  3.33,  enter  3*.  33 
in  lot  midpoint  column. 

Step  2  Find  PPX  for  first  lot,  PPX  •  lot  midpoint,  enter  3.33 
in  PPX  for  lot  1. 

Step  3  Find  ppy  for  lot,  PPy  ■  lot  value  ~  lot  size  16967/10  - 
1697  (few  people  can  plot  even  this  accurately). 

Step  4  Fill  in  cumulative  units  column,  in  this  case  no  previous 
production  so  at  conclusion  of  lot  one,  ten  units  will 
have  been  produced,  enter  a  10. 

The  following  steps  will  be  repeated  as  often  as  necessary  to  complete  work 
sheet: 

Step  5  Get  lot  midpoint  for  lot. 

Lot  midpoint  ■  lot  size/2,  enter  on  worksheet 

-  .  - - -  step  6  Get  PPX  for  lot .  — 

PPX  -  lot  midpoint  plus  all  previous  productions. 

■  lot  midpoint  plus  cumulative  units  through  previous  lot. 

Step  7  Get  PPY  for  lot. 

PPy  -  lot  value/lot  size 

Step  8  Get  cumulative  units. 

Cumulative  units  ■  cumulative  units  through  previous  lot 
plus  lot  size  for  this  lot. 

If  not  the  last  lot  go  back  to  Step  5  and  repeat. 
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Each  step  should  be  checked  with  results  In  Figure  14-4b. 


This  part  of  the  plot  should  always  be  carried  out  even  if  one 
suspects  the  cumulative  average  curve  is  the  true  formulation. 

Figure  14-5  shorn  how  this  data  plots  on  log- log  paper. 

This  plot  shows  unit  one  costs  of  2500  on  an  83%  slope. 

The  2500  comes  from  reading  the  curve  at  x  ■  1.  The  83%  or  slope  of 
the  curve  is  measured  as  follows: 

Select  an  X  and  2X-,  in  this  case  X  ■  20,  2X  -  40  was  chosen. 

Read  the  curve  at  the  X  values,  the  points  are  (20,  1115),  (40,  925). 
Divide  Y2x  by  Yx  and  change  to  percent,  925/1115  =  .83  *  83%. 

If  in  doubt,  select  severed  different  X,  2X  pairs,  and  average  the 
results . 

Note  that  four  of  the  points  are  perfect  (or  nearly  perfect)  fit  but 
lot  one  is  too  far  to  the  left.  This  is  characteristic  of  first  lots,  they 
are  frequently  ignored  while  fitting,  then  a  check  is  made. 

The  b  of  a  learning  curve  can  be  calculated 

.  log  "slope" 
log  2 

where  "slope"  means  the  decimal  fraction  not  the  percent.  In  this  case 


,  log  (.83)  _  -.08092191 

D  *  log  (.2)  "  +.30103000 


We  know; 


1696.7  »  2500X 


-.26881676 


so 


.678680 


-.26881676 

A 


-.26881676 


and  log  (.678680)  -  -.26881676  (log  (X) ) 

so  -.16833495/-. 26881676  -  log  X 

■  .62620704  ■  log  X 

X  -  4.229 


This  means  our  rule  of  thumb  X  ■  3.33  is  about  .9  units  too  low.  If  we 
moved  the  first  lot  plot  point  to  (4.229,  1697)  it  would  be  on  the  line  (or 
extremely  close)  so  we  can  feel  our  curve  is  "right." 


UNIT  NO 


Most  practioners  of  learning  curve  analysis  do  not  check  the  first  lot 
point,  if  its  to  the  left  of  the  line  they  feel  satisfied. 

Now  let  us  set  up  our  worksheet  to  plot  the  cumulative  average  curve. 

Our  worksheet  needs  three  more  columns,  a  cumulative  total  cost  column, 
and  PPxE'  f°r  plQt  points.  Enter  a  zero  above  the  row  for  the 

first  lot  in  the  cumulative  total  cost. 

Hie  cumulative  total  cost  column  is  the  sum  of  lot  values  through  that 
lot.  Figure  14-6a  shows  these  added  columns. 

Hie  plot  point  X  for  each  lot  is  the  number  of  units  that  will  have 
been  made  at  the  end  of  the  lot,  this  is  simply  the  cumulative  units 
through  that  lot. 

The  plot  point  Y  is  the  cumulative  average  for  all  units  made  by  end 
of  lot, 

»  PP—  =  cumulative  total  cost  -  cumulative  units 

*  Y 

Having  gone  through  the  unit  curve  plot  points,  the  steps  for  calculating 
the  cumulative  average  plot  points  are: 

Step  1  Add  the  lot  value  to  the  cumulative  total  cost  column 
Ilf  its  the  first  lot,  simply  enter  lot  value  in 
cumulative  total  cost  column) . 

Step  2  Enter  cumulative  units  through  the  lot  in  the  PP^  column 
(The  X  plot  point  is  number  of  units  at  the  end  of 
the  lot) . 

Step  3  Divide  cumulative  total  cost  by  cumulative  units  and 

enter  quotient  in  the  PPy  column.  (As  the  name  indicates 
the  cumulative  average  is  average  cost  for  units  1  through 
X.) 

Step  4  If  xts  not  the  last  lot,  go  back  to  step  one  and  repeat 
the  sequence. 

Hie  completed  worksheet  is  shown  in  Figure  14-6b. 

This  data  is  plotted  in  Figure  14-5.  The  plot  behaves  as  it  should, 
the  result  in  nonlinear  (on  log-log  paper)  and  the  cumulative  average 
curve  would  be  above  the  unit  curve. 

The  data  above  is  all  part  of  learning  curve  and  plots  as  it  should. 

Now,  consider  the  data  in  Table  14-4. 
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Table  14-4 


Lot 

Lot 

Cumulative 

Cumulative 

Average 

NO. 

Size 

Units 

(labor  hours) 

1 

10 

10 

3857.8 

2 

15 

25 

2057.8 

3 

25 

50 

1457.8 

4 

20 

70 

1286.4 

5 

40 

110 

1130.6 

If  one  uses  the  cumulative  average  theory  and  plots  this  data  and  draws 
the  "best"  line  one  gets  a  cumulative  average  curve,  unit  one  cost  about 
11,000,  slope  about  70%.  Since  the  points  do  not  fall  on  the  line,  there 
is  some  room  for  judgment. 

Example  of  a  Problem  with  Nonrecurring  Costs; 

Figure  14-7  shows  this  plot  and  learning  curve.  The  plot  looks  reason¬ 
able,  and  one  could  get  the  impression  this  curve  described  the  output. 

This  is  a  false  impression.  The  whole  picture  is  caused  by  a  nonrecurring 
cost  in  the  first  lot.  Let  us  reconstruct  the  data  so  we  cam  plot  the  unit 
curve. 


Lot  Is  There  were  10  units  in  this  lot,  they  average  3857.8 
hours,  the  cost  of  the  lot  is  10  (3857.8)  -  38578 


of  course,  the  average  cost  is  3857.8. 


|  Lot  2:  There  were  15  units  in  this  lot.  At  25  units  the  cumulative 

'  average  is  2057.8.  The  cost  of  this  lot  is  then: 


Cumulative  total  for  25  units  ■  25(2057.8)  *  51445 

Minus  cumulative  total  for  10  units  38578 

Cost  of  lot  12867 

|  Average  cost  for  lot  857.8 


Lot  3:  There  were  25  units  in  this  lot.  At  50  units  the  cumulative 
average  is  1457.8.  The  cost  of  this  lot  is  then: 


Cumulative  total  for  50  units  «  50(1457.8)  -  72890 

I  Minus  cumulative  total  for  25  units  51445 

Cost  of  this  lot  of  25  units  21445 

Average  cost  for  this  lot  857.8 


Lot  4:  There  were  20  units  in  this  lot,  at  the  end  of  the  lot  70 
units  had  been  made.  Cumulative  average  is  1286.4. 

I  Cost  of  this  lot  is: 


8 


Cumulative  Average  Plot  and  Curve 
for  Data  in  Table  14-4 


9  10  2  3  456789  10 


Cumulative  total  for  70  units  *  70(1286.4)  ■  90048 

Minus  cumulative  total  for  SO  units  72890 

Cost  of  this  lot  17158 

Average  cost  for  this  lot  857.9 


Lot  5:  There  were  40  units  in  this  lot,  at  the  end  of  the  lot  110 
units  had  been  made  at  an  average  cost  of  1130.6.  Cost  of 
this  lot  is: 

Cumulative  total  cost  for  110  units  -  110(1130.6)  «  124366 
Minus  cumulative  total  cost  for  70  units  90048 

Cost  of  this  lot  34318 

Average  cost  for  this  lot  858  0 

it  should  be  obvious  the  average  cost  or  cost  per  unit  for  all  units 
after  the  first  lot  is  approximately  858  labor  hours. 

The  correct  analysis  of  this  data  is  shown  in  Table  14-5. 


Table  14-5 


I 


Lot 

Lot 

Lot 

Non- 

Recurring 

Average  Recurring 

No. 

Size 

Value 

Recurring 

Cost 

Cost  for  Lot 

1 

10 

38578 

30000 

8578 

857.8 

2 

15 

12867 

12867 

857.8 

3 

25 

21445 

21445 

857.8 

4 

20 

17158 

17158 

857.9 

5 

40 

34318 

34318 

858.0 

Total  cost  of  110  units  as  recorded: 


Nonrecurring  Cost 
Recurring  Cost 


30000 

94366 


Total  Cost 


124366 


If  one  had  to  predict  a  follow-on  lot  the  best  prediction  would  be 
858  hours  per  unit. 

Note  carefully:  If  one  used  only  the  cumulative  average  curve  one 
would  erroneously  decide  learning  curves  did  apply. 

Another  case  that  could  be  misinterpreted  and  learning  curve  analysis 
erroneously  applied  is  the  case  where  costs  axe  essentially  fixed  by  pro¬ 
duction  changes  during  periods  of  time.  Once  again  preparation  of  a  work¬ 
sheet  such  as  Figure  14-4  will  detect  that  lot  costs  are  the  same,  inde¬ 
pendent  of  lot  size. 
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The  propar  preparation  of  worksheets  will  help  greatly  in  determining 
whether  or  not  the  learning  curve  does  apply. 

Example  of  Pse  of  Unit  Curve: 

The  major  use  of  learning  curve  analysis  is  to  estimate  costs.  Assume 
we  have  to  estimate  the  cost  of  100  units  of  an  airframe  and  we  find  the 
following i 

-  Manufacturing  labor  will  have  an  85%  slope  type  unit  curve 
with  a  first  unit  cost  of  230,000  hours. 

-  Direct  materials  will  have  a  95%  slope,  a  unit  type  curve, 
first  unit  cost  $875,000  197X  dollars. 

-  Engineering  labor  will  have  a  75%  slope,  a  unit  curve,  unit 
one  cost  964,000  hours. 

-  Tooling  will  have  a  70%  unit  type  curve,  unit  one  cost, 

1,230,000  hours. 

-  Quantity  control  labor  will  be  13%  of  manufacturing  labor  hours. 

\ 

This  is  estimated  or  known: 

EXAMPLE  14-1 


Direct  Cost 

Slope  of  Unit  Curve 

Cost  of  Unit  One  (A) 

Manufacturing  Labor 

85% 

230,000  hrs 

Materials 

95% 

875,000  197X  dollars 

Engineering  Labor 

75% 

964,000  hrs 

Tooling 

70% 

1,230,000  hrs 

In  each  case  the  lot  plot  point  will  be  33.3.  Since  we  have  one  lot 
of  100  units  the  lot  midpoint  will  be  100/3,  and  with  a  first  lot  the  lot 
midpoint  and  plot  point  are  the  same.  For  the  lot: 

Total  Cost  -  100(A) (33.3) b 
and  b  -  log  "slope "/log  2 


One  way  to  estimate  the  costs  would  be  to  use  a  calculator.  In  this  case 
an  HP-65. 


To  estimate  manufacturing  cost: 


On  Keyboard  Enter 

Press 

Read 

Nothing 

DSP.  8 

0 .00000000 

33.3 

ENTER 

33.3 

.85 

Log 

-.07058107 

2 

Log 

.30103000 

Nothing 

i. 

• 

-0.23446525 

Nothing 

.43958182 

230,000 

X 

101103.8195 

100 

X 

10110382 

hence  the  estimate  of  direct  labor  hours  in  100  units  this  airframe  is 
10,110,382  at  $10.00  an  hour  (19 7X  rate)  direct  labor  cost  would  be 
$101.102820  million  or  101.1  million  197X  dollars. 

For  direct  material  costs  we  will  need: 

100(875000) C33.3b) 
and  b  »  log  .95/log  2  -  -0.0740058 

the  estimate  (made  with,  the  same  HP65  calculator)  is: 

100 (875000) (33. 3”‘°^400058) 

-  100(875000)1.77150411)  -67.5  million 
197X  dollars 

Engineering  labor  we  need: 

100  (964000) (33. 3b) 
and  b  ■  log  .75/log  2  ■  -.4150375 
the  estimated  cost  is: 

22501195  engineering  hours. 

The  tooling  hours  are: 

100(1230000) (33. 3b) 
and  b  -  log  .7/log  2  -  -.51457317 
the  estimated  cost  is: 


20253317  hours 


The  final  estimate  is  quality  control  labor  *13%  mfg  labor  •  13%  or 
10,110,332  or  1,314,350  hours. 


Another  way  to  make  the  estimates  is  to  use  a  computer  program.  In 
this  case  the  LEARN*  program  on  the  General  Electric  system  was  used.  Results 

ares 


GE  Estimate 


HP  65  Estimate 
Above 


$%  Difference 


GE 

HP65 


*100 


Labor  (hrs)  10,063,390  10,110,382  +0.46 
Material  ($197X)  67,013,090  67,506,610  +0.74 
Engineering  (hrs)  23,3p8.156  22,501,195  -3.46 
Tooling  (hrs)  21,883,572  20,253,317  -7.44 


Certainly,  the  entries  made  with  the  HP65  calculators  are  workable 
estimates . 

The  third  and  simplest  way  to  make  the  estimate  is  to  use  log- log  paper. 
For  exanqple,  to  estimate  labor  hours  you  should  calculate  the  values  on  the 
85%  learning  curve. 

(Carry  as  many  digits  as  feasible) 


Unit 

Value 

Source 

1 

230,000 

given 

2 

195,000 

85%  of 

unit 

1  value 

4 

166,200 

85%  of 

unit 

2  value 

8 

141,300 

85%  of 

unit 

4  value 

16 

120,100 

85%  of 

unit 

8  value 

32 

102,100 

85%  of 

unit 

16  value 

64 

86,700 

85%  of 

unit 

32  value 

Then  plot  and  connect  them  as  shown  in  Figure  14-8.  Read  the  value  at 
X  »  33.3.  It  is  (very  close  to)  101,000.  This  is  the  average  value  for 
the  lot  so  100  units  time  the  average  value— 101,000  times  100  *  10,100,000 
which  value  is  close  to  our  other  estimates  and  was  very  easy  to  get,  a 
piece  of  graph  paper,  plus  straight  edge  and  pencil  were  all  the  tools  needed. 

To  Estimate  Costs  of  a  Follow-on  Lot  (UC) t 
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EXAMPLE  14-2: 


Direct  Cost 


Slope  of  Unit  Curve  Cost  of  Unit  One  (A) 


Manufacturing  Labor 

80% 

250,000  hrs  ' 

Materials 

93% 

850,000  $198X 

Engineering  Labor 

77% 

950,000  hrs 

Tooling 

73% 

120,000  hrs 

The  lot  contains  25  units,  the  midpoint  is  25  t  2  *  12.5.  Previous 
production  is  (or  will  be)  75  units  so  the  plot  point  is  75  +  12.5  or  87.5. 


Each  estimate  will  be  from  an  equation. 


Cost  of  Lot  ■  (Number  of  units  in  lot)  (Cost  of  unit  one) 
(Plot  point  raised  to  b) 

TC2  -  25(A) (87. 5b) 

The  appropriate  b's  are 


for  80%:log(.80)  t  log(2) 
90%:log( .93)  t  log(2) 
77%: log ( .77)  f  log(2) 
73%:log(.73)  f  log(2) 


-.0321928 

-0.104697 

-0.377070 

-0.454032 


The  estimated  costs  of  lot  are  calculated: 


Manufacturing  labor 

Cost  -  (25) (250,000) (87. 5'0*321928) 

»  (25)  (250,000) (.626149) 

*  1,481,500  hours  approximately 

Materials  »  (25) (950,000) (87. 5-0*3’7070) 

-  (25) (950,000) (.185237) 

■  4,399,400  hours  approximately  - 

Tooling  hours  -  (25) (120,000) (87. 5"0,454032) 
-  (25) (120,000) (.131300) 

■  393,900  hours  approximately 


These  estimates  for  lots  after  the  first  lot  are  more  nearly  those  of 
high  sophistication  computer  programs.  Compare  the  outputs:  ' 


GE  Computer  Estimate 


Estimate 

(Rounded  to  same  no.  of 
digits  as  desk  calculator) 

Desk  Calculator 
Estimate 

PCT 

Diff 

Mfg  labor 

1,480,900 

1,481,500 

0.04% 

Materials 

13,302,900 

13,305,700 

0.02% 

Engineering  labor 

4,397,700 

4,399,400 

0.04% 

Tooling  hours 

393,800 

393,900 

0.03% 

Figure  14-9  shows  the  graphs  of  the  manufacturing  labor  and  material  curves. 
Unit  87.5  represents  the  average  cost  for  the  lot,  the  cost  of  the  lot 
is  the  average  multiplied  by  the  number  of  units  in  the  lot. 

For  manufacturing  labor  Z  read: 

Y_  -  59,200  times  25  -  1,480,000 
L 

For  direct  material: 

7.  -  528,000  times  25  -  13,200,000 

L 

Figure  14-10  shows  the  engineering  labor  and  tooling  hours  curves  from  which 
For  engineering  labor 

Y_  *  175,000  hrs  times  25  units  »  4,386,000 

L 

For  tooling  hours 

Yt  ■  15,700  times  25  ■  393,000 

li 

A  difficulty  with  graphs  is  that  you  cannot  read  them  to  more  than  three 
digits  so  these  estimates  look  worse  than  those  calculated.  However,  the 
precision  from  the  calculation  is  more  apparent  than  real— it's  a  function 
of  the  calculator  word  length.  There  are  only  three  digits  in  the  cost  of 
unit  one  so  not  more  than  three  digits  are  significant . 

Cumulative  Average  (CA)  Formulation: 

If  the  cumulative  average  curve  is  the  form  of  the  learning  curve  to 
be  used,  then  the  estimation  is  different.  With  the  unit  curve  we  vised  a 
number  near  the  middle  of  the  lot  because  the  average  was  in  the  middle  of 
the  lot. 

When  the  cumulative  average  formulation  describes  the  cost  pattern  the 
average  for  X  units  is  plotted  at  X.  With  an  80%  curve  one  has: 

Yxr  AX-322928 

If  unit  one  cost  1500  hours  then: 

Y5-  1500(5  -**^*a)  .  1500  (0.595637) 

-  893 

893  is  a  good  estimate  of  the  average  cost.  If  further  calculations 
are  to  be  attempted  we  should  carry  more  digits,  say  use  893.456  as  the 
average  and  then  the  lot  will  cost  5(893.456)  ■  4467  or  better  4470. 


Plot  of  Unit  Learning  Curve  for 
Manufacturing  Labor  and  Direct 
Materials  for  Example  14-2 
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Plot  of  Unit  Learning  Curve  for 
Engineering  Labor  and  Tooling 
Hours  for  Example  14-2 
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Table  14-6  describes  an  80%  cumulative  average  curve  with  a  unit  one 
cost  of  2500  hours. 


Table  14-6 


Unit 

(X) 

Average  Cost 
of  X  units 

Y  -  2500Xb 

A 

Total  Cost 
of  x  units 

TC  =  X2500Xb 

A 

Cost  of  X 

Unit 

TCx  -  TC(X-1) 

1 

2500 

2500 

2500 

2 

2000 

4000 

1500 

3 

1755 

5266 

1266 

4 

1600 

6400 

1134 

5 

1489 

7446 

1045 

6 

1404 

8425 

980 

7 

1336 

9  354 

928 

8 

1280 

10240 

886 

9 

1232 

11091 

851 

10 

1191 

11913 

821 

As  with  the  unit  curve. 

there 

are  three  ways 

to  make  estimates  of  the 

costs 

of  a  lot  given  the  CA 

formulation— one  may 

use 

a  graph,  an  electronic 

calculator,  or  a  computer  program. 

EXAMPLE  14-3: 

A 

very  new  ultra  modern  fighter  airframe  has 

the 

following  cost  pattern. 

Learning  Curve 

Cost 

Form 

Slope 

First  Unit  Cost 

Mfg  Labor 

Cum  Av 

82% 

1,300  thousand  hrs 

Dir.  Material 

Cum  Av 

93% 

5,215  thousand  198X  dollars 

Engineering 

Cum  Av 

77% 

228  thousand  hrs 

Tooling 

Cum  Av 

75% 

325  thousand  hrs 

Estimate  cost  of  a  first  lot  of  25  airframes. 

The  relevant  exponents  (b's)  are: 

82% :b  -  log  .82/log  2  -  -0.286304 
93%:b  -  log  .93/log  2  -  -0.104697 
77%:b  -  log  .77/log  7  -  -0.377070 
75%:b  -  log  .75/log  2  -  -0.415038 
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Using  an  electronic  calculator  one  estimates  the  cost  of  the  25  airframes 
as: 


Direct  Labor  =  (13001  (25  9  .286304^ ^5}  thousand  hours 
=  12,931  thousand  hours 

Direct  Material  =  (5215) (25~° '104697) (25)  thousand  198X  dollars 
=  93,075  thousand  198X  dollars 

Engineering  =  (228) (2599’ 97797°) (25)  thousand  hours 
»  1693  thousand  hours 

Tooling  =  (325) (25  0 ‘4^503®)  (25)  thousand  hours 
*  2136  thousand  hours 

To  graph  these  curves  one  uses  exactly  the  same  technique  as  with  the 
unit  curve  but  estimates  of  cost  are  made  differently. 

To  graph  the  Manufacturing  Labor  Curve  get  some  points  on  the  curve; 
how  many  is  a  matter  of  choice  but  X  as  large  as  64  or  128  (depending  on 
the  size  of  the  paper)  is  desirable. 

Cumulative  Average  Curve 
for  Manufacturing  Labor  From  Example  Three 


Cumulative  Average  Cost 


Unit 

(thousands  of  hours) 

Source 

1 

1300 

Given 

- 

2 

1066 

82% 

of  value 

for 

unit 

1 

4 

874 

82% 

of  value 

for 

unit 

2 

8 

717 

82% 

of  value 

for 

unit 

4 

16 

588 

82% 

of  value 

for 

unit 

8 

32 

482 

82% 

of  value 

for 

unit 

16 

64 

395 

82% 

of  value 

for 

unit 

32 

Figure  14-11  shows  the  learning  curve  for  manufacturing  labor  and  direct 
materials.  Figure  14-12  is  the  plot  of  the  learning  curve  for  engineering 
labor  and  tooling  hours.  Using  the  plots  to  estimate  the  costs  for  this 
lot  of  25  units  we  have: 

Manufacturing  labor  ■  (518)  (25)  ■  12,950,000  hours 
Direct  materials  ■  (3625) (25)  ■  90,625,000  dollars 
Engit.  «.ring  labor  *  (67.5)  (25)  »  1,687,500  hours 
Tooling  *  (85.8) (25)  =  2,145,000  hours 

As  in  other  cases  the  results  from  graphing  and  calculator  operations  are 
very  close. 


mulative  Average  Learning 
rves  For  Engineering  Labor 
d  Tooling  From  Example  14-3 
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If  you  wish  to  estimate  the  cost  of  a  lot  other  than  the  first  lot, 
you  must  remember  that  cumulative  average  times  the  number  of  units  gives 
total  cost  so  you  must  subtract  the  cost  of  the  previous  lots. 

For  example,  if  we  wish  to  estimate  the  cost  of  a  lot  of  25  units,  (say 
units  76-100)  using  an  electronic  calculator: 

TC76-100  =  A  [<100><l00b>  ~  (75) (75b) ] 

Direct  labor  -  1300 [(100) (100"° *286304)  -  (75) (75-0*286304) ] 

*  1300(26.754202  -  21.788319) 

=  6,456,000  hours 

Direct  Material  -  5215[(100) (100-0 *104697)  _  (75) (75~° *104697) ] 

»  5215(61.745598  -  47.725224) 

-  73,116,000  198X  dollars 

Engineering  Labor  -  228 [ (100) (100-0 ' 3770 70)  -  (75) (75“°‘377070) ] 

-  228(17.614081  -  14.724207) 

-  659,000  hours 

In  each  case  we  calculate  the  cost  of  100  units  and  subtract  the  cost 
of  75  units  to  get  the  cost  of  the  lot. 

Using  graphs  we  read  the  value  at  unit  100  which  gives  the  cumulative 
average  at  unit  100,  multiply  by  100  to  get  the  total  cost  of  100  units  and 
subtract  the  reading  at  unit  75  times  75. 

Using  Figure  14-11  and  14-12  we  get: 

Manufacturing  Labor  -[(345)  (100)  -  (376) (75) ] 

-  6,300,000  hours 

Direct  Material  -[(3190)  (100)  -  (3300) (75)] 

-  71,500,000  198X  dollars 

Engineering  Labor  -[(39.8)  (100)  -  (44.1) (75)] 

-  672,000  hours 

Tooling  -[(47.3)  (100)  -  (53.7K75)] 

-  702,000  hours 


Graphs  give  slightly  different  answers  but  as  am  estimate  they  should 
be  very  close  to  those  answers  calculated.  Graphs  are  the  simplest  to  use 
amd  they  cost  less  tham  a  sophisticated  electronic  calculator. 

Learning  curves  cam  be  useful  tools  for  estimating  direct  costs.  In 
spite  of  all  the  technological  advances  that  have  oc cured,  a  piece  of  graph 
paper  remains  a  powerful  amd  relatively  easy  tool  to  use  in  making  these 
estimates . 

Alternatives  to  graph  paper,  the  modern  electronic  calculator  amd  the 
computer,  are  nice  but  unnecessary  amd  paper  is  much  cheaper.  Calculators 
amd  computers  provide  a  lot  of  digits  but  the  amalyst  must  remember  estimates 
are  no  better  tham  the  data  on  which  they  aure  based  and  a  few  digits  are 
usually  all  we  cam  trust. 

Anybody  cam  get  a  few  packages  of  log-log  paper  amd  happily  join  the 
estimating  community.  Why  don't  you? 


CHAPTER  15 
ANALYSIS  OF  VARIANCE 


INTRODUCTION 

In  Chapter  8  hypothesis  tests  were  made  on  the  mean  of  one  population,  and 
on  the  difference  of  means  of  two  populations.  This  chapter  discusses  a  tech¬ 
nique  for  comparing  r  means  and  is  called  analysis  of  variance  (ANOVA) .  The 
hypotheses  to  be  tested  are: 

HoJH  «  y  ■  y  m  ...  m  u 

12  3  r 


H  :  y  +  y  j*  y  y„ 

112  3  r 


If  the  null  hypothesis  is  rejected,  the  conclusion  is  that  at  least  one  pair 
of  means  are  significantly  different,  i.e.,  they  come  from  different  popula¬ 
tions. 

Recall  that  for  testing  one  and  two  means ,  the  t  test  was  used.  The 
t  test  could  be  used  to  test  r  means  but  the  process  would  become  cumbersome 
because  each  possible  pair  of  means  would  have  to  be  tested  separately.  For 
exanqple ,  an  experiment  or  process  calling  for  the  comparison  of  five  means 
would  require  ten  separate  t  tests  (you  haven't  forgotten  that  5  objects  taken 
2  at  a  time  is  10) . 

ONE-t ACTOR  ANALYSIS  OF  VARIANCE 

In  testing  hypotheses  on  the  difference  between  means 


H  : 
1 


y  +  y 
1  2 


the  null  hypothesis  is  rejected  when  the  critical  difference  between  sample 
means  (X^-  X2)  is  greater  than 


±  1*2/  <*•*•  ■  n 

l  2 
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and  a2  and  o2  are  unknown  but  equal  (see  Case  3,  page  8-21.  In  order  to 

perform  the  foregoing  test  an  estimate  of  the  population  variance  was  needed. 
This  was  found  as 


I 

I 

| 

I 

I 


I 


h 


where  a2  is  the  estimator  for  a2  which  is  the  variance  of  the  two  papulations 
in  question  (assumed  to  be  equal) ,  and 

s2  .  (nl  ~  1}  S1  +  (n2  ~  1}  8| 

8p  "  nx  +  n2  -  2 

In  the  test  of  hypotheses  on  r  means,  the  F  distribution  is  used,  which  is 


2 


Both  numerator  and  denominator  zzz  estimates  of  the  population  variance  a2. 

In  this  case,  however,  the  denominator  is  referred  to  as  a  pooled  variance  and 
is  not  the  same  s2  used  in  Case  3  of  the  t  test,  The  pooled  variance  can  be 
found  by : 


(nx  -  1)  s2  +  (n2  -  1)  s2 

JL  +  _L 

n,  +  n_  -2 

1  2 

m  m 

n!  n2 

2 

It  can  be  readily  be  seen  that  ns-  is  also  an  estimator  of  the  population 

3C  2 

variance.  Since  s 2  is  an  estimator  of  c2  and  cr|  •  #  then  a2  »  a*  •  n  which 

is  estimated  by  n  •  si.  AS  will  be  demonstrated  later,  the  actual  confutations 

x  nsS 

for  the  test  statistic  F  -  is  made  easier  (and  more  meaningful)  through 

P 

the  use  of  stubs  of  squares. 

Single  Factor  AMOVA  Model.  A  factor  is  an  independent  variable  which  has  been 
isolated  to  determine  its  effect  on  a  specified  dependent  variable.  For 
example,  a  maintenance  officer  might  be  interested  in  determining  the  effect 
of  maintenance  technician  skill  levels  (factor)  on  the  amount  of  time  needed 
to  complete  a  job.  in  this  case,  the  single  factor,  skill  level,  would  con¬ 
sist  of  several  factor  levels,  e.g.,  3,  5,  7,  etc.  The  dependent  variable, 
time  to  complete  a  job,  may  be  "dependent"  upon  the  skill  level  of  the  tech¬ 
nician  doing  the  work. 
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The  linear  equation  for  the  single  factor  ANOVA  model  Is  given  by: 


e. .  »  random  effect,  the  difference  in  X±j  values  not 
^  accounted  for  by  difference  in  skill  level 

where  e^  •  x±j  - 

Assumptions  for  the  ANOVA  model  are  made  on  this  random  component;  for 
statistical  tests  using  this  model  to  be  valid,  the  e^  must  be: 

1 .  Independent 

2.  Normally  distributed  for  each  treatment  (row)  subpopulation. 

3.  E(ei;J)  -  0 

4.  Var  ■  v2  (homogeneity) 

ANOVA  Table.  In  our  exaaple,  the  maintenance  officer  recognizes  that  the 
amount  of  time  to  complete  a  specific  job  fluctuates  each  time  it  is  performed 
and  wishes  to  isolate  the  reasons  for  the  differences  in  completion  times. 

The  ANOVA  table  below  is  a  convenient  way  to  show  the  sources  of  fluctuation 
(called  variation  in  ANOVA) . 


Table 


ANOVA  Table 


Source 

Sum  of 
Squares  (SS) 

Degrees  of 
freedom  (df) 

Mean  sum  of 
Squares  (MSS) 

F 

SS 

Explained 

SS 

r 

r-1 

-2 

X 

(Between  rows) 

(MSSr) 

Unexplained 

(0 

•w 

r(n-l) 

2  SS 

s*  ■  e 

p  FET-u 

(Within  rows 

(MSS  ) 

or  error) 

o 

Total 

CO 

CO 

ft 

nr-1 

The  sources  of  variation  are  best  Illustrated  by  the  equation  using  sums  of 
squares: 

+  l  l  (X  -  X)2 

i  j 

unexplained 

variation 

(SS  ) 
e 


where, 

X  •  overall  sample  mean  found  by 


l  1  j  *  x) 2  - 


i  j 


n  l  (X.  -  X): 
i  1 


total 

variation 

(SSt> 


explained 

variation 

(SS  ) 
r 
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The  total  variation  represents  differences  between  each  observed  value  and  the 
mean  of  all  values.  This  total  variation  has  two  components .  The  explained 
variation  component  represents  differences  between  the  mean  of  each  factor 
level  (row)  and  the  overall  mean.  This  component  or  source  of  variation 
isolates  the  portion  of  the  total  variation  which  is  explained  or  accounted 
for  by  the  factor  levels,  i.e. ,  how  much  of  the  total  variation  in  conflation 
tine  is  explained  by  the  fact  that  technicians  of  different  skill  levels 
accomplish  the  job. 


The  remaining  source  of  variation,  unexplained,  is  the  portion  of  total 
variation  which  has  not  been  explained  by  the  differences  of  factor  levels. 
This  variation  is  said  to  be  due  to  chance  (random)  or  the  result  of  other 
factors  not  identified  in  the  model. 


After  the  sources  of  variation  have  been  isolated,  an  F-value  is  obtained 

by: 


explained  variance 
unexplained  variance 


SS 
_ r 

r-1 

SSe 

r(n-l) 


MSS 
_ r 

MSS 

e 


A  large  value  of  F  means  that  the  explained  variance  is  much  larger  than  the 
random  variance  and  that  the  hypothesis  of  no  difference,  i.e.. 


H, 


o  ■ 


P 


r 


should  be  rejected. 

EXAMPLE  15-1.  Three  road  bedding  machines  are  being  tested  by  the  Base  civil 
Engineer  who  is  interested  in  knowing  whether  the  average  output  differs  among 
the  machines .  Output  is  measured  as  linear  feet  of  roadbed  produced  per  hour. 
The  hypotheses  to  be  tested  would  be: 


v 


*i’  *  "1  *  “2  *  “3 

The  test  will  be  conducted  at  a  significance  level  of  a  *  0.05.  A  sample  of 
five  hours'  output  is  obtained  for  each  of  the  machines.  Table  15-2 
shows  the  actual  data  in  linear  feet  per  hour. 
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MACHINE  HOOKS 


Machinal'S. 

Hours  j-1 

2 

3 

5 

i-1 

54.6 

45.7 

56.7 

37.7 

48.3 

2 

53.4 

57.5 

54.3 

52.3 

64.5 

3 

56.7 

44.7 

50.6 

56.5 

49.5 

Die  component*  of  variation  (in  output)  are  computed  below  and  placed  in 
the  ANOVA  table  fora  for  analysis. 

r 


s& 

1 


n  nr 


r 

where  J  x2 
i-1 


X?. 


[total  of  row  i]2  -  J  4  J  X  V 

i-1  vVi  ijj 


/  r  n  \2 

[total  of  all  observations 32  -  I  I  £  x  I 

\i-l  j-1  13  / 


n  -  number  of  observations  in  each  row  (assumes  row  sample 

sixes  are  equal) 

r  «  nvaaber  of  rows 


SS_  -225^137.61^89  .  158.4 


ss 


t 


l  l 


i-1  j-1 


X?. 


613,089 


41,456.4 


583.8 


To  test  the  hypothesis  that  the  row  means  are  equal,  a  critical  value  of 
F  (Fc)  is  obtained  from  a  table  of  critical  values  of  the  F  statistic  for 

FrT^  i »  or  F*  __  which  is  4.10 

r(n-l) ,  a  12,  a  ■  .05 

Since  F  }  F  ,  Hq  cannot  be  rejected  and  it  must  be  concluded  that  the  average 
outputs  of  She  three  machines  are  approximately  equal. 

Unequal  Sample  Sizes.  The  ANOVA  Table  can  be  constructed  for  cases  in  which 
the  sample  sizes  are  not  equal  as  follows : 


1 


lained  Z  (l  x l  l 

(Between  rows)  y  \i»l  1-1  y 

n.  r 

I»i 
i-1  * 


Jnexplained 
(Within  rows) 


SS,.  -  SS 
t  r 


r  n. 


Z  I  ^ 

i-1  j-1  3 


/  r  ni 

I  Z> 

u°i  j-i 

I  ni 

i»l 


£  (n.-l)  - 5 - 

. L.  i  r 

Z  (n i-D 

i-1 


Z 

i-1 


Multiple  Comparisons .  The  overall  F-test  for  the  model  is  a  test  to  determine 
whether  there  is  at  least  one  pair  of  means  which  are  significantly  different. 

To  isolate  which  of  the  means  are  statistically  different  ,  a  method  of  con¬ 
trasting  should  be  used.  The  method  of  H.  Scheffe  presented  below  is  the  most 
conservative  of  the  a  posteriori  approaches  to  comparing  means.  Other  methods 
such  as  Duncan's  multiple  range  test,  Tukey,  and  least  significant  difference 
(LSD)  can  be  found  in  statistical  applications  texts. 

The  Scheffe  method  of  contrasting  is  useful  particularly  when  the  researcher 
has  not  decided  a  priori  the  exact  nature  of  contrasting  to  be  done  prior  to 
obtaining  the  experimental  data. 

Pairwise  comparisons  are  found  by: 


W1  ‘  U2 


(Xx  -  S2)  ± 


SSL,  •  fh  •  1 H 1  • 


where 


X  -  X  -  difference  in  means  of  samples  (rows)  1  and  2  being  tested 
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V — ^ - - 


p(r-l)  «  tabled  F  value  for  confidence  le”el  of  1  -  a  and  r  -  1 
a,n(r-l)  degrees  of  freedom  in  the  numerator,  and  n(r-l)  degrees 
of  free  (tom  in  the  denominator. 


MSS 

e 

C 


•  mean  sum  of  squares  (error)  from  the  ANOVA  table 

*  contrasting  coefficient.  In  the  equation  above, 
Cx  »  1  and  C2  -  -1.  I  CL  =  0. 


In  the  example  above,  the  average  output  of  machines  1  and  2  using  a  >  .05 
would  be : 


yl  "  W2  “  (48*6  '  56*4) 


-  -7.8  ±  10.55 

The  95  percent  confidence  interval  thus  constructed  is : 

-18.35  ^  u1  ~  2.75 

The  relevant  hypotheses  which  are  tested  in  this  example  are: 

V  yl  •  y2  "  0 


Since  the  confidence  interval  on  contains  the  value  zero,  the  null 

hypothesis  cannot  be  rejected,  i.e.,  there  is  no  significant  difference  in  the 

average  outputs  of  roadbed  machines  1  and  2.  This  is  to  be  expected  since  the 
overall  F-test  resulted  in  nonrejection  of  H0:  ■  U2  “  W3.  It  should  be 

noted  that  by  using  Scheffe's  technique  for  comparison,  any  number  of  means  may 
be  compared  simultaneously  without  any  loss  in  the  level  of  significance. 

TWO-FACTOR  ANALYSIS  OF  VARIANCE 


TVo  factor  ANOVA  enables  the  experimenter  to  identify,  isolate,  and  test 
an  additional  source  of  variation.  In  the  previous  example,  it  was  found  that 
the  differences  in  the  outputs  of  the  machines  were  not  due  to  differences  in 
the  machines  themselves,  i.e.,  the  machines  factor  was  not  a  significant  source 
of  the  variation  in  output.  However,  perhaps  the  machine  operators ,  a  second 
factor,  may  have  an  effect  on  machine  output. 
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Vi 


Extending  the  example,  suppose  there  were  five  operators,  each  having 
worked  the  three  machines.  The  data  table  is  presented  as  before,  this  time 
considering  the  five  operators. 

Table  15-5 


Machine'X. 

Operator  j  0  1 

2 

3 

3 

5 

i  -  1 

54.6 

45.7 

56.7 

37 

.7 

48.3 

2 

53.4 

57.5 

54.3 

52 

.3 

64.5 

3 

56.7 

44.7 

50.6 

56 

.5 

49.5 

Hie  two  factor  ANOVA  model  is: 


X.  .  *  u  +  R.  +  C.  +  e.  . 

JO  i  3  13 


where 


X„  «  value  of  the  dependent  variable  being  measured 


overall  mean 

row  (treatment)  effect 


i 

l  R.  -  0 
L  x 


y.  -  y 


»  Column  (block) effect 


ci  ■  “j  -  “ 


l  °j  ■ 0 


e.  ,  ■  random  effect;  the  difference  in  X  .  not  accounted  for  by 


differences  in  row  and  column 


il  “7 

observations. 
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The  assumptions  for  the  two-factor  model  are  the  same  as  those  for  the 
one-factor  ANOVA  and  will  not  be  repeated  here.  The  two-way  ANOVA  Table 
is  presented  below: 

Table  15-6 
TWO-WAY  ANOVA  TABLE 


Source 

SS 

Explained  (Between 

SS 

rows,  or  treatment) 

Explained  (Between 

SS 

columns,  or  block) 

Unexplained 

(error) 

SS 

e 

Total 

SS. 

t 

MSS 

F 

SS 

MSS 

r 

-  *  Hdb 

r-1  r 

MSS 

e 

SS 

MSS 

c 

c-1  c 

MSS 

e 

(r-1) (c-1)  SS_ 


(r-1) (c-1)  “  MSSe 


Similar  to  the  one-factor  model,  the  sums  of  squares  are  obtained  by: 

l  l  <X  -X)2  =  c  l  (X .  -X)2  +  r  l  (5  -X)2  +  l  l  (X  -X  -X  +  X)2 
i-1  j-1  13  i-1  j-1  °  i-1  j-1  3  ° 


Total  Bow  Column 

Variation  -  Variation  +  Variation  + 


Unexplained 

Variation 

SS 


SS.  and  SS  are  found  the  same  as  in  the  one-factor  model.  SS  is 
t  r  e 

best  calculated  by: 

SS  -  SS.  -  SS  -  SS 
e  t  r  c 


where 


c 

I  X2 


.j  X 


X2.  *  [column  totals]2  -  £  [  £  X,  T 

°  j-1  Vi-1  i7 


EXAMPLE  15-2.  Continuing  the  illustration  of  the  roadbedding  equipment, 
the  following  ANOVA  table  for  the  two-way  classification  is  constructed: 

Table  15-7 


ANOVA  TABLE 


Source 

SS 

df 

MSS 

F 

Between  rows 

154.8 

2 

77.4 

13.1 

Between  columns 

381.7 

4 

95.4 

16.2 

Unexplained 

47.3 

8 

5.9 

Total 

583.8 

14 

We  can  now  test  whether  there  is  a  discernible  difference  in  machines 
and/or  operators  in  the  presence  of  each  other.  In  other  words,  while 
testing  for  differences  in  machines,  we  are  taking  into  account  the  dif¬ 
ferences  in  operators. 

To  test  for  differences  in  machines : 


MSS 

F  -  - £■ 

MSS 

e 


77.4 

5.9 


13.1 


The  tabled  value  for  F2  -  4.46  and  the  hypothesis  of  no  dif- 

8, a  3  .05 

ference  in  machines  is  rejected.  Thusrin  the  presence  of  the  operators, 
there  is_  a  significant  difference  in  the  average  output  of  the  machines. 


TO  test  for  differences  in  operators: 


F 


MSS 
_ c 

MSS 


e 


95.4 

5.9 


16.2 


Obviously,  the  hypothesis  of  no  difference  in  operators  can  also  be 
rejected.  Thus,  there  is_  a  significant  difference  in  operators  in  the 
presence  of  the  machines. 


Multiple  Comparisons .  in  a  manner  similar  to  that  shown  in  the  one  factor 
ANOVA  model,  the  H.  Scheffe  method  of  contrasting  cam  be  done  with  column 
(block)  effects  as  well  as  row  (treatment)  effects.  The  Scheffe  confidence 
interval  is  given  by: 


column  means 


The  results  are  Interpreted  the  same — if  the  conficence  interval  contains 
zero,  there  is  no  statistical  difference  between  the  pair  of  means. 


"Collapsing"  the  ANOVA  Table.  After  the  F-tests  on  treatment  and  blocking 
effects  have  been  performed  and  it  is  found  that  one  of  the  factors  is  not 
significant,  i.e.,  the  null  hypotheses  cannot  be  rejected,  the  ANOVA  Table 
should  be  reconstructed  into  a  one-factor  classification.  For  example,  if 
the  blocking  effects  cure  not  significant,  the  explained  variation  (between 
columns)  should  be  added  to  the  residual  or  unexplained  variation.  The 
model  should  then  be  tested  as  a  one-factor  ANOVA  for  treatments  (row 
effects)  only.  This  is  commonly  referred  to  as  collapsing  the  ANOVA  Table 
and  is  illustrated  below: 

Table  15-8 
TWO-FACTOR  TABLE 


Source 

SS 

Between  rows 
(treatment) 

260 

Between  columns 
(Block) 

144 

Unexplained 
(Residual  or 
error) 

44 

Total 

448 

MSS 

F 

65 

14.8 

14.4 

3.3 

At  a  »  .01,  block  or  column  effects  cure  not  significant  and  should  be 
added  back  into  the  residual.  The  following  table  results. 

Table  15-9 


ONE-FACTOR  "COLLAPSED"  TABLE 


Source 


Within  rcws 

188 

18 

(Error  or 
residual) 

Total 


448 


22 


The  hypothesis  on  row  differences  should  then  be  retested  to  determine 
whether  it  remains  significant.  Note  that  in  the  collapsed  table,  the 
degrees  of  freedom  as  well  as  SSc  has  been  added  into  the  residual  effects. 
The  treatment  effects  (SSr)  and  total  variation  remain  the  same. 

OTHER  EXPERIMENTAL  DESIGNS 

Up  to  this  point,  we  have  been  considering  one  and  two  factor  fixed 
effects  ANOVA  models.  Fixed  effects  models  are  those  in  which  all  of  the 
levels  of  the  factor (s)  have  been  included  in  the  experiment.  In  the 
example  on  the  roadbedding  equipment,  we  had  to  assume  only  three  machines 
and  only  five  operators  were  available.  If  more  machines  and  operators 
were  available  and  only  three  and  five  had  been  selected  respectively,  the 
example  would  have  been  a  random  effects  model  and  the  calculations  for 
sources  of  variation  would  have  been  slightly  different. 

In  the  previous  models,  it  was  also  assumed  there  was  no  interaction 
among  the  main  factor  effects.  Such  interaction  would  occur,  for  example , 
if  some  of  the  operators  liked  a  particular  machine  and  others  disliked  it. 
To  examine  the  interaction  effects ,  more  than  one  observation  per  cell  is 
required.  Again  in  the  previous  example,  each  operator  worked  each  machine 
one  time  and  the  output  was  observed  and  recorded.  To  analyze  the  inter¬ 
actions,  each  operator  would  have  to  run  each  machine  several  times.  Die 
process  of  obtaining  more  than  one  observation  per  cell  is  called  replica¬ 
tion.  Die  advantage  of  replication  is  that  more  information  can  be 
obtained.  However,  this  advantage  must  be  weighed  against  the  cost  and 
accuracy  of  the  data  to  be  obtained. 

TWo-Factor  ANOVA  with  Interaction.  Die  equation  for  the  model  is  given  as: 

Xijk  ”  y  +  Ri  +  Cj  +  (RC)ij  +  eijk 


where 


ijk 


(RC) 


ij 


=ij* 


observed  value  of  the  dependent  variable 
row  effect 
column  effect 

effect  due  to  the  interaction  of  row  and  column  factors 
effect  due  to  random  fluctuation,  experimental  error. 


R^  and  c  are  called  main  effects  since  these  are  the  main  factors 
selected  for  isolation  in  the  experiment.  Hypotheses  are  tested  for  each 
of  the  main  effects  as  well  as  the  interaction  effect  to  determine 
statistical  significance.  The  assumptions  for  this  model  are  made  on  the 
random  error  term,  e^j^,  and  are  the  same  as  presented  earlier.  Die  ANOVA 
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Table  for  the  two- factor  ANOVA  with  interactions  is  given  below 


Table  15-10 


Source 


Main  effects 


(Residual) 


ss 

ss  - 

r 

n*c  l  (XL 

-  X)2 

• 

SS  « 
c 

n*r  l  a 

-X)2 

ss 

rc 

oil 

-s. 

1. . 

X  .  +  X) 

•  J  • 

2 

SS  - 

e 

^  <Xijk 

SS  ■ 

w 

III  «1Jk 

-  X)2 

*nrc-l 


*n  «  number  of  observations  per  cell,  i.e.,  replications. 


EXAMPLE  15-3.  A  DOD  repair  facility  is  interested  in  studying  the  effects 
of  several  factors  in  a  repair  process  which  machines  large  castings  on  a 
hand-controlled  lathe.  The  DOD  wants  to  know  if  the  type  of  cutting  tool  and 
angle  setting  of  the  tool  significantly  affect  the  amount  of  time  it  takes  to 
machine  a  casting.  TWo  types  of  cutting  tools  and  two  angle  settings  are 
used.  For  the  experiment  it  has  been  decided  that  two  castings  will  be 
lathed  for  each  combination  of  cutting  tool  and  angle  setting,  leaving 
2*2*2  or  8  castings  to  be  turned,  lhe  time  to  complete  each  casting  will  be 
recorded  in  minutes. 


In  order  that  the  experiment  remain  unbiased,  the  order  in  which  the 
castings  were  lathed  was  determined  by  a  coin  toss  with  the  following  results 


Table  15-11 


The  first  easting,  therefore,  was  using  Tool  Type  1,  set  at  cutting 
angle  B.  The  model  to  be  used  is: 

Xijk  "  V  +  T±  +  +  (TA)ij  +  e1^Je 

where 

«  cutting  time  in  minutes 

U  ■  true  average  cutting  time  for  the  process 

T,  »  the  effect  on  cutting  time  of  t^ol  type  i  with 

i  -  1,2 

Aj  *  effect  on  cutting  time  of  angle  setting  j  with  j  »  A,B 

(TA) ..  -  interaction  effect  of  tool  type  with  cutting  angle  on 

3  average  cutting  time 


e . ..  *  random  fluctuation;  since  there  are  two  replications, 

13  k  -  1,2 


Data  from  the  experiment  are  recorded  in  minutes  in  the  following 
table. 
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Table  15-12 


TOOL  TYPE  (T, ) 


Replication 


Angle  (A^) 


Angle  (A^ ) 


1 

n 

B 

Sum 

m 

B 

Sum 

29 

27 

56 

25 

32 

57 

2 

28 

23 

51 

26 

27 

53 

Sum  (A^) 

57 

50 

107 

51 

59 

110 

Sum  (T^) 

107 

110 

Total 


Using  the  data  from  the  table,  the  components  of  variation  are  com¬ 
pleted  from  the  following: 

t  -  hu*  -  svs&  of  all  observations  *  X. . . 

ijk  13 

n  -  number  of  replications 

r  *  number  of  rows 

c  '  *  number  of  columns 


I 

ijk  "ijk 


Interaction  sum  of  squares  can  be  obtained  as  the  remainder  by 


SS_„  «  SS,.  -  SS_  -  SS  -  SS 
rc  tree 


Table  15-13 
ANCVA  TABLE 


Source 

SS 

df 

MSS 

F 

Main  Effects 
Between  Tools 

1.12 

1 

1.12 

.2 

Between  Angles 

.12 

1 

.12 

.02 

Interactions 

28.13 

1 

28.13 

5.2 

Unexplained 

21.5 

4 

5.375 

Total 

50.87 

7 

The  critical  value  for  F^a  „  0>5  Is  7.71.  Therefore,  no  significant 
effect  on  cutting  time  has  been  found  due  to  tool  type,  cutting  angle,  or 
their  interaction.  If  the  test  were  mode  at  a  -  .01,  only  the  interaction 
would  have  been  significant. 

In  summary,  ANCVA  can  be  a  powerful  tool  in  isolating  the  sources  of 
variation  for  selected  measure able  variables.  The  processes  presented  in 
the  previous  pages  are  not  all-inclusive  of  the  wide  range  of  ANCVA  tech¬ 
niques.  The  reader  should  consult  texts  in  experimental  design  for  further 
reference. 


CHAPTER  16 


LINEAR  PROGRAMMING 


INTRODUCTION 

Linear  programming  (LP)  is  a  mathematical  technique  for  determining 
the  manner  in  which  limited  available  resources  can  be  allocated  among  com¬ 
peting  alternatives  in  such  a  way  as  to  optimize  a  particular  objective 
while  satisfying  a  number  of  constraints.  As  such,  LP  is  one  of  a  number 
of  management  science  methods  known  collectively  as  "constrained  optimiza¬ 
tion  techniques."  Because  the  problem  of  resource  allocation  is  common  to 
all  organizations  and,  to  varying  degrees,  an  activity  of  concern  to  man¬ 
agers  at  all  levels  of  responsibility.  If  is  one  of  the  most  widely  used 
management  science  techniques. 


>lications 


The  following  list,  which  is  by  no  means  exhaustive  nor  even  necessarily 
comprehensive,  provides  a  representative  inventory  of  prototype  situations 
to  which  LP  has  been  successfully  applied: 


1 )  Accounting 

2 )  Agricultural  Planning 

3)  Air  Pollution  Control 

4)  Airline  Scheduling 

5)  Assignment  and  Scheduling 

6 )  Capital  Budgeting 

7 )  Capital  Expansion 

8 )  Contract  Award 

9 )  Cargo  Loading 

10)  Diet/Menu  Planning 

1 1 )  Environmental  Protection 

12)  Equipment  Acquisition 

13)  Financial  Mix 

14)  Highway  Planning 

15)  Hospital  Administration 

16)  Inspection 

1 7 )  Material  Blending 

18)  Media  Mix  (Marketing) 

19)  Personnel/Manpower  Planning 

20 )  Plant  Location 

21)  Police  Patrol  Sector  Planning 

22)  Portfolio  Selection 

23)  Product  Mix 

24)  Production  Scheduling 

25)  Production-Vendor  Selection 


26)  Project  Management 

27)  Public  Works  Management 

28)  Regional  Planning 

29)  Solid  Waste  Management 

30)  Structural  Analysis 

31)  Traffic  Flow  Analysis 

32)  Transportation-Distribution 

33)  Trim  Loss 

34)  Water  Resource  Management 

35)  Welfare  Allocation 

Characteristics  and  Assumptions 

Linear  Programming  can,  in  general,  be  applied  to  any  decision  situation 
having  the  following  characteristics: 

1.  A  limited  quantity  of  economic  resources  (e.g.,  labor,  capital, 
time,  equipment,  material,  etc*)  is  available  for  allocation. 

2.  The  resources  are  used  in,  or  have  utility  for,  the  production  of 
goods  and/or  services. 

3.  There  are  alternative  courses  of  action,  i.e.,  there  are  two  or 
more  ways  in  which  the  resources  can  be  used.  Bach  is  called  a  solution  or 
a  program. 

4.  There  is  a  clearly  identifiable  criterion  by  which  alternative 
programs  can  be  compared  (e.g.,  cost,  profit,  etc*).  Each  product  or  ser¬ 
vice  to  which  resources  are  allocated  yields  a  return  or  makes  some  contri¬ 
bution  in  terms  of  the  stated  criterion. 


5.  The  allocation  is  usually  restrained  by  several  limitations  or 
requirements  termed  constraints . 

6*  It  is  possible  to  describe  the  decision  situation  as  a  mathematical 
model,  i.e.,  the  criterion  and  constraints  can  be  expressed  as  (linear) 
mathematical  equalities  and/or  inequalities. 

In  addition  to  meeting  these  characteristic  conditions,  decision  situa¬ 
tions  for  which  LP  is  appropriate  must  satisfy  the  following  basic  assumptions 

1 .  Certainty .  All  data  concerning  the  decision  situation  is  assumed 
to  be  known  with  certainty,  i.e.,  the  model  is  deterministic. 

2.  Proportionality.  All  relationships  (equalities  and/or  inequalities) 
in  the  model  are  linear,  i.e.,  the  criterion  and  constraints  change  propor¬ 
tionally  to  the  level  of  each  activity. 

3.  Additivity.  The  total  utilization  of  each  resource  is  determined 
by  adding  together  that  portion  of  the  resource  required  for  the  production 


of  each  of  the  various  products  or  activities.  Similarly,  the  net  change 
in  the  criterion  value  is  the  sum  of  the  individual  contributions  associated 
with  each  of  the  products  or  activities .  The  assumption  of  proportionality 
guarantees  linearity  if  and  only  if  joint  effects  or  interactions  are  non¬ 
existent  . 

4.  Divisibility.  The  variables  in  the  model  are  continuous  and  can 
assume  fractional  values.  Divisibility  means  that  a  fractional  value  of 
the  decision  variables  makes  sense,  and  the  activities  are  not  restricted 
to  whole  numbers. 

5.  Nonnegativity .  Negative  activity  levels  (i.e.,  negative  alloca¬ 
tions  are  not  permissible*  All  variables  must  assume  nonnegative  values. 

6.  Independence.  Complete  independence  of  coefficients  is  assumed,  both 
among  activities  and  among  resources.  For  example,  the  cost  of  providing 
one  unit  of  service  A  has  no  effect  on  providing  one  unit  of  service  B. 

MODEL  FORMULATION 


Mathematical  Modeling 

A  model  is,  in  general,  a  simplified  abstraction  or  representation  of 
some  real  system  or  phenomenon  in  which  an  individual  is  interested  and 
about  which  that  person  wants  to  make  a  decision.  A  mathematical  model  is 
a  system  of  mathematical  equalities  and/or  inequalities  that  represent  the 
decision  situation  of  interest.  As  previously  noted,  an  LP  model  is  a 
mathematical  model  in  which  the  component  variables  are  continuous  and  non¬ 
negative,  the  model  parameters  or  constants  are  assumed  to  be  known  with 
certainty  (i.e.,  they  are  deterministic),  and  all  relationships  linking  the 
variables  am  d/or  constants  are  linear,  i.e.,  they  are  proportional  and 
additive. 

Formulating  an  LP  Model 

In  formulating  an  LP  model  to  help  improve  decision-making  effective¬ 
ness,  it  is  helpful  to  use  some  type  of  conceptual  framework  to  analyze 
the  decision  situation.  One  simple  and  convenient  framework  is  to  think 
about  the  problem  as  a  system,  i.e.,  as  a  set  of  interrelated  and 
interacting  components.  Therefore,  to  describe  any  decision  system,  we 
need  to  identify: 

1.  the  components  in  the  system  (i.e.,  the  variables  and  constants)  and 

2.  the  relationships  linking  or  existing  among  the  components  of  the 
system. 

System  Components: 


The  first  step  in  the  analysis  of  any  decision  situation  is  so  basic 
that  it  is  often  overlooked.  Who  is  the  manager  or  decision  taker  relative 


to  the  problem  at  hand?  In  many  cases,  the  analyst  and  manager/ decision 
taker  are  the  same*  In  other  situations,  however,  the  analyst  acts  In  a 
staff  role  relative  to  the  manager  for  whom  the  analysis  Is  being  per¬ 
formed.  This  determination  Is  a  vital  prerequisite  to  constructing  a  deci¬ 
sion  system  model  that  accurately  and  completely  captures  the  values  and 
range  of  Influence  or  control  of  the  manager. 

In  any  decision  situation,  there  Is  typically  (at  least)  one  variable 
that  Is  the  primary  focus  of  the  manager's  attention.  It  Is  the  thing  that 
he  or  she  wants  to  control.  It  Is  a  system  quality  or  characteristic  In 
which  the  decision  taker  places  some  value.  It  Is  also  a  yardstick  or  cri¬ 
terion  by  which  managers  judge  the  performance  or  health  of  the  real  system 
with  which  they  are  concerned  and  responsible.  In  an  LP  decision  system 
model,  this  component  is  commonly  referred  to  as  the  criterion  variable  and 
is  most  often  denoted  by  the  letter  "Z" .  In  private  enterprise,  profit, 
sales,  return  on  Investment,  market  share,  customer  satisfaction,  and 
employee  turnover  are  examples  of  criterion  variables  commonly  of  Interest 
to  managers.  In  the  public  sector,  cost,  productivity,  and  quality  of  ser¬ 
vice  are  frequently  used  criteria.  Within  the  Air  Force,  costs,  opera¬ 
tional  readiness  rates,  fill  rates,  turnaround  times,  system  reliability, 
and  employee  satisfaction  are  familiar  criterion  variables. 

The  objective  in  any  decision  situation  is  to  adopt  a  policy  or  imple¬ 
ment  a  management  action  that  leads  to  a  satisfactory  level  of  ( system) 
performance  as  measured  by  the  criterion  variable.  In  general,  objectives 
are  of  two  basic  types :  ( 1 )  satisfaction  objectives  and  ( 2 )  optimization 

objectives.  The  satisfaction  objective  is  one  for  which  the  decision  taker 
specifies  a  value  of  the  criterion  variable  that  is  "good  enough."  If  a 
system  manager  specifies  that  the  system  of  interest  should  have  a  relia¬ 
bility  of  at  least  0.90,  the  manager  has  determined  a  level  or  value  of  the 
criterion  variable  (reliability)  that  is  satisfactory  or  good  enough  for 
all  conditions  in  which  that  system  will  exist  and  operate.  In  other 
situations,  the  decision  taker's  objective  is  to  optimize,  i.e.,  maximize 
or  minimize,  the  value  of  the  criterion  variable  subject  to  a  set  of  con¬ 
ditions,  limitations,  or  constraints  that  must  be  satisfied.  Maximizing 
system  reliability,  subject  to  constraints  of  cost,  time,  and  available 
technology,  is  a  familiar  optimization  objective.  The  objective  of  any  LP 
model  is  to  optimize  the  value  of  the  criterion  variable,  subject  to  a  set 
of  constraints  or  limiting  conditions.  Using  standard  LP  notation,  the 
objective  can  be  expressed  as: 

maximize  Z,  or 

minimize  Z 


Having  determined  the  decision  taker's  criterion  and  associated 
(optimization)  objective,  the  next  step  in  building  the  LP  model  is  to 
complete  the  set  of  system  components.  For  each  candidate  variable  con¬ 
sidered  for  inclusion  in  the  system,  the  analyst  must  pose  the  question, 
"Does  it  significantly  affect  or  influence  the  criterion  variable?”  If  the 
answer  to  this  question  is  "yes”  the  variable  is  included.  If  the  answer 
is  "no"  the  variable  is  excluded.  Obviously,  the  process  of  both  nominating 
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and  discriminating  between  relevant  and  irrelevant  variables  is  very 
subjective,  being  greatly  affected  by  the  analyst’s  education,  experience, 
values,  insight,  creativity,  and  Judgment. 

For  each  component,  variable,  or  constant  included  in  the  system  model 
set,  the  analyst  must  next  ask,  "Can  the  manager  influence  or  change  the 
value  of  this  component,  i.e.,  can  the  manager  do  anything  about  it?"  If 

the  answer  is  "yes,"  the  variable  is  a  resource.  If  the  answer  is  "no,"  it 

is  an  environmental  factor.  Resource  variables  are  often  referred  to  as 
decision  variables  or  management  action  variables,  reflecting  the  capa¬ 
bility  of  the  manager  to  manipulate  or  influence  these  factors.  These 

variables  are  denoted  by  the  symbol  "Xj"  where  the  subscript  "j"  is  an 

index  used  to  label  the  variables  (j  =  1,2,..., n).  Environmental  com¬ 
ponents  in  a  system  generally  take  one  of  two  basic  forms.  First,  they  may 
be  variables  which  operate  more  or  less  independently  of  the  manager's 
control.  For  example,  the  demand  for  a  particular  item  stocked  by  Base 
Supply  is  a  variable  over  which  Base  Supply  management  has  no  control  and 
is,  therefore,  part  of  their  environment.  The  second  form  in  which 
environmental  components  may  be  incorporated  into  a  decision  system  model 
is  as  the  domain  boundary  for  a  decision  on  resource  variable.  Most  man¬ 
agers,  for  example,  must  conduct  their  operations  and  activities  within 
budget  or  financial  constraints  imposed  by  other  individuals  or  agencies. 
Consequently,  while  the  amount  of  money  to  be  allocated  by  the  manager 
between  competing  alternatives  is  a  decision  variable  within  the  manager's 
control,  the  total  amount  to  be  allocated  is  not  within  the  manager's 
control,  and  is,  therefore,  aun  environmental  factor.  In  general,  when  amy 
variable  is  bounded  in  its  domain,  those  limiting  variable  values  define 
the  boundary  of  the  system  and  its  environment. 

System  Relationships 

A  system  is  more  tham  a  set  or  collection  of  two  or  more  elements. 

A  system  is  a  set  of  two  or  more  components  that  are,  in  some  fashion, 
linked  by  one  or  more  relationships.  The  relationships  in  a  system  can  be 
described  in  a  number  of  ways.  First,  relationships  tend  to  be 
"directional"  in  nature,  i.e.,  changes  in  one  variable  produce  or  induce 
changes  in  others: 


Figure  16-1 
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In  the  simplest  two-component  system  illustrated  in  Figure  16-1,  a  change 
in  variable  Xj.,  induces  some  change  in  variable  X2.  In  this  example,  X2 
is  said  to  be  dependent  (on  changes  in  X^),  while  X^  is  independent  (of 
changes  in  X2) •  In  functional  notation,  this  expression  can  be  denoted  as: 

X2  *  f (Xx> 
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System  relationships  can  also  be  characterized  as  either  direct  or 
inverse .  In  a  direct  relationship,  a  variation  in  the  independent  variable 
(i.e.,  either  increase  or  decrease)  results  in  a  change  in  the  dependent 
variable  in  the  same  direction.  In  an  inverse  relationship,  a  variation  in 
a  particular  direction  by  the  independent  variable  leads  to  a  change  in  the 
opposite  direction  by  the  dependent  variable.  Symbolically,  a  direct  rela¬ 
tionship  is  conventionally  denoted  by  (+)  while  an  inverse  relationship  is 
indicated  by  ( - ) . 


Figure  16-2 


Figure  16-3 


To  complete  the  description  of  a  relationship,  it  is  necessary  to 
specify  or  describe  the  exact  character  of  the  link,  ideally,  through  some 
type  of  mathematical  expression  (equality  or  inequality)  that  describes 
best  the  behavior  of  the  dependent  variable  in  response  to  changes  of  a  spec¬ 
ified  magnitude  (and/or  rate)  in  the  independent  variable.  These  rela¬ 
tionships  may  be  continuous  and  linear  or  curvilinear  (e.g.,  exponential, 
logarithmic,  etc.)  or  they  may  be  less  "well-behaved,"  having  skips,  spikes, 
or  other  irregularities.  In  LP,  as  the  name  implies,  all  relationships  are 
assumed  to  be  linear.  While  the  assumption  of  linear  relationships  is  often 
a  gross  oversimplification  of  the  real  system  of  interest,  it  has  the  advan¬ 
tage  of  facilitating  the  development  of  an  analytical  solution  or  policy. 

In  an  LP  model,  there  are  two  basic  types  of  relationships:  (1)  the 
objective  or  criterion  function  and  (2)  system  (boundary)  constraints.  The 
objective  (criterion)  function  expresses  a  linear  relationship  between  the 
dependent  criterion  variable  (Z)  and  the  various  independent  resource 
(decision)  variables  (Xj): 

Optimize:  Z  *  C,*X,  +  C-*X_  . .+C  *X 

1122  n  n 


where  Cj  is  a  constant  (model  parameter)  representing  the  marginal  contri¬ 
bution  to  Z  resulting  from  a  unit  change  in  X j .  Expressed  more  efficiently 
in  summation  notation,  the  objective  function  can  be  written  as: 


n 

Optimize:  Z  =  ^  Cj  •  X j  j 


All  of  the  remaining  relationships  in  an  LP  decision  system  model  are 
considered  as  constraints  to  be  satisfied  in  optimizing  the  objective  func¬ 
tion.  System  constraints  typically  express  the  relationship  between  deci¬ 
sion  variables  in  the  system  and  various  environmental  factors  (parameters) 
representing  resource  limitations  and/or  system  output  requirements.  The 
constraints  in  an  LP  model  can  be  either  equalities  or  inequalities.  The 
set  of  constraints  can  be  expressed  mathematically  as: 


AllXl  +  A12X2+“-+AlnXn  ='  bl 


A21  X1  +  A22  X2  +  "*  +  A-  X-  b^ 


2n  n 


or,  alternatively: 
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J  a, j  X.  (-,*,-)  b-  (i  =  1,2,... ,m) 
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where  a^j  is  a  constant  (model  parameter)  representing  the  marginal  con¬ 
sumption  of  resource  b^  or,  alternatively,  the  marginal  satisfaction  of 
requirement  b^  resulting  from  a  unit  increase  in  decision  variable  X^ . 

While  system  constraints  generally  relate  decision  variables  and 
environmental  constraints,  they  may  also  specify  boundary  values  on  the 
domains  of  the  respective  decision  variables.  In  all  LP  models,  for 
example,  decision  variables  cannot  be  negative  (you  can't  allocate  negative 
quantities  of  a  resource).  Therefore,  LP  models  must  include  a  set  of 
"nonnegativity"  constraints  of  the  form: 


Xj  *0  (  j  =  1,2, ..  .,n) 


The  General  LP  Model 


To  summarize,  every  LP  decision  system  model  is  a  variation  of  the 
following  general  formulation: 


Optimize : 


Z  -  CX  xx  +  c2  X2  +...+  Cn  Xn 
subject  to  (s.t.): 

An  X1  +  A12  x2  +***+  Aln  xn  { -'  "»  bl 

A21  X1  +  a22  x2  +  ,“  +  A2n  xn  ("' 

Aml  X1  +  Am2  x2  +  ,,,+  Ajrtn  xn  ( 1' 

Xj  -  0  (j  a  1,2,. ..,n) 

or,  alternatively,  in  summation  notation: 

n 

Optimize:  Z  *  \  C .  X . 
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Xj  >  O  ( j-1,2,...,n) 

Formulation  Examples 
EXAMPLE  16-1 

Problem  Statement 
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You  are  responsible  for  inspecting  the  work  of  two  contractors  currently 
working  on  the  base.  From  past  experience  and  available  historical  data, 
you  estimate  that  each  hour  spent  inspecting  Contractor  No.  1  will  result 
in  a  savings  to  the  government  of  $75.  Similarly,  each  hour  spent 
inspecting  Contractor  No.  2  results  in  an  estimated  savings  of  $50.  You 
estimate  that  a  total  of  120  manhours  of  inspection  time  are  available  to 
allocate  between  the  two  contracts  each  week.  Existing  policy  requires 
that  you  devote  at  least  30%  of  the  available  inspection  time  to  each 
contract.  In  addition,  your  transportation  budget  permits  your  inspectors 
to  log  a  total  of  approximately  600  miles/week  in  support  of  their  work  on 
these  two  contracts.  You  estimate  that,  on  the  average,  about  4  miles  are 
driven  for  each  hour  spent  inspecting  Contractor  No.  1  and  6  miles  for  each 
hour  spent  inspecting  Contractor  No.  2.  How  should  you  allocate  the 
available  inspection  manhours  between  the  two  contracts? 

System  Analysis 

System  Components: 

1.  Criterion  Variable  (Z):  savings  in  dollars,  the  objective  is 
to  maximize  (Z>0). 


X^  ■  manhours  allocated  to  inspection  of  Contractor  No.  1  (X ^  2  36) 
X2  *  manhours  allocated  to  inspection  of  Contractor  No.  2  (X2  £36) 
3.  Environmental  Variables  (Xj) 

X ^  •  total  inspection  manhours  allocated  (X^  $120) 

X^  =  total  miles  driven  inspecting  Contractors  1  and  2  (X^  a  400) 
System  Relationships 

Figure  16-4 


LP  Model 

maximize:  2  *  75Xj.  +  50X2 

s.t.  Xj  +  X2  ^  120  (X3) 

X1  >  36 

X2  >  36 

5Xx  +  8X2  <  600  (X4) 

(Note:  the  nonnegativity  constraints,  Xj  >  0,  required  in  all  LP  models 
are  insured  by  the  policy  conditions  that  require  at  least  30%  of  the 
available  inspection  time  to  be  allocated  to  each  contract,  i.e.,  at  least 
120  x  0.3  *  36  manhours). 


EXAMPLE  16-2 


Problem  Statement 

You  are  estimating  the  cost  of  1f000  tons  of  concrete  to  be  used  in  a 
project  to  repair  the  airfield  pavement  at  your  base.  Sand  for  the  concrete 
is  available  from  two  sources.  Washed  beach  sand  can  be  obtained  at  a 
delivered  cost  of  $60/ton  and  contains  4  parts  fine  sand,  3  parts  coarse 
sand,  and  5  parts  gravel.  Clean  river  sand  can  be  obtained  at  a  delivered 
cost  of  $  100/ton  and  contains  3  parts  fine  sand,  6  parts  coarse  sand,  and 
9  parts  gravel.  Each  batch  of  concrete  must  contain  4  parts  gravel,  3 
parts  fine  sand,  and  3  parts  coarse  sand.  How  much  sand  should  be 
purchased  from  each  source? 

System  Analysis 


System  Components: 


1.  Criterion  Variable  (Z):  Cost  in  dollars;  the  objective  is  to 
minimize  Z.  (Z  >  0). 

2 .  Decision  Variables  ( X j ) : 

X]_  »  tons  of  beach  sand  to  purchase  (X^  *  0 ) 

X2  ■  tons  of  river  sand  to  purchase  (X2  *0) 

3.  Environmental  Variables : 

X3  ■  tons  of  fine  sand  used  (X3  *  3/10  x  1000  ■  300  tons) 

X4  *  tons  of  coarse  sand  used  {X4  “  3/10  x  1000  ■  300  tons) 

X5  »  tons  of  gravel  used  (X5  -  4/10  x  1000  ■  400  tons) 


System  Relationships: 


X1  -  0 


beach 


Figure  16-5 
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Problem  Statement 


A  new  Federal  Environmental  Protection  Organization  is  being  formed, 
and  there  jure  2,000  professional  and  1,000  nonprofessional  positions  which 
need  to  be  filled.  Recruiting  costs  average  $1,000  for  each  professional 
position  and  $400  for  each  nonprofessional  position.  Typically,  these 
costs  are  20%  higher  than  average  for  recruiting  women  and  30%  higher  than 
average  for  recruiting  minorities  (men  and  women).  HEW  has  examined  state 
employment  records  and  has  mandated  that  women  should  constitute  at  least 
40%  of  new  hirings  within  the  agency  and  minorities  should  constitute  at 
least  50%  of  new  hirings.  How  should  the  required  positions  be  filled? 


System  Analysis 

System  Components: 

1.  Criterion  Variable  (Z):  Cost  in  dollars;  the  objective  is  to 
minimize  Z  ( z  >  0 ) . 

2.  Decision  Variables  (X^j^);  Let  X^j^  ■  the  number  of 
individuals  of  type  (i,  j,  k)  hired  where: 


i  -  1 

professional 

-  2 

nonprofessional 

j  -  1 

men 

-  2 

women 

k  -  1 

minority 

-  2 

nonminority 

3.  Environmental  Variables  (Xj): 

Xx  ■  total  no.  of  professional 3  hired  (X^  ”  2000) 

X2  -  total  no.  of  nonprofessionals  hired  (X2  *  1000) 
X2  ■  total  no.  of  women  hired  (X2  >  1200) 

X^  »  total  no.  of  minorities  hired  (X4  >  1500) 


INSTALLATION 

VENDOR 

1 

2 

3 

4 

CAPACITY 

1 

$5500 

$6500 

$7000 

8 

2 

$6500 

■ 

10 

3 

$6000 

$8000 

$9000 

1 

12 

DEMAND 

10 

5 

8 

7 

30 

How  much  fuel  should  be  purchased  from  each  vendor  for  delivery  to  the 
respective  bases? 


System  Analysis 

System  Components: 

1.  Criterion  Variable  (Z):  Cost  in  dollars;  the  objective  is  to 
minimize  Z  (Z  >0). 

2.  Decision  Variables:  Let  X^j  *  the  number  of  tank  cars  of  fuel 
shipped  from  vendor  i  to  installation  j,  where  i  *  1,2,3  and  j  *  1,2, 3, 4. 

3 .  Environmental  Variables : 


X^  =  demand  at  installation  1  (X^  *  10) 
X2  ■  demand  at  installation  2  (X2  “  5) 
Xj  *  demand  at  installation  3  (X^  *  8) 
X4  *  demand  at  installation  4  (X4  -  7) 


Xg  *  supply  capacity  at  vendor  1  (X5  <  8) 

Xg  »  supply  capacity  at  vendor  2  (Xg  <  10) 

X7  *  supply  capacity  at  vendor  3  (X7  <  12) 


System  Relationships 


Figure  16-7 
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<ijk*  0  <i  -  1,2,3;  j  -  1,2, 3, 4) 


SOLUTION  TECHNIQUES 

One  of  the  primary  factors  leading  to  the  popularity  of  LP  as  an  aid 
to  decision  taking  is  that  solution  algorithms  are  readily  available.  For 
very  simple  systems  involving  only  two  or  three  decision  variables,  the 
optimal  policy  or  solution  cam  be  determined  graphically.  For  more  complex 
problems,  a  solution  procedure-the  simplex  method-offers  a  relatively  effi¬ 
cient  technique  for  determining  the  optimal  policy.  While  the  graphical 
solution  is  not  often  used  in  practice  because  of  its  restricted  applicabil¬ 
ity,  it  provides  a  good  basis  for  understanding  the  manner  in  which  more 
complex  problems  can  be  solved. 
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The  Graphical  Method 

Consider  again  the  LP  model  developed  in  Example  16-1: 
maximize  Z  -  75X  ^  +  SOX  2 

s.t.  X1  +  x2  <  120 

x  1  >36 

X2  >  36 

5Xx  +  8X2  <  400 

The  first  step  in  the  graphical  solution  procedure  is  to  construct  an 
orthogonal  coordinate  system  with  each  of  the  axes  representing  one  of  the 
decision  (resource)  variables: 


System  Relationships 


Figure  16-8 
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Note  that,  since  all  LP 
models  require  Xj  >  0,  only 
one  quadrant  need  be  constructed 


Next,  each  of  the  constraints  is  constructed  in  this  coordinate  system: 


Constructing  the  first  constraint,  +  X2  <  120,  defines  a  triangular 
region  bounded  by  that  constraint  and  each  of  the  axes.  In  a  similar 
fashion,  the  other  constraints  can  be  constructed  in  the  coordinate 
system: 
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The  shaded  area  represents  that  region  of  points  or  coordinates  which 
simultaneously  satisfy  all  of  the  constraints.  Because  only  the  points  in 
the  shaded  area  represent  feasible  solutions  to  the  problem,  this  area  is 
referred  to  as  the  feasible  region  or  (feasible)  solution  space.  Note, 
too,  that  for  this  particular  problem,  the  first  constraint  constructed, 
i.e.,  X ^  +  X2  _  120,  does  not  bound  the  feasible  region  at  any  point  -  it 
is  "redundant"  and  is  not,  in  this  case,  really  a  limitation.  But  which  of 
the  infinite  number  of  points  in  the  feasible  solution  space  (Xj,  X  2) 
yields  the  greatest  value  of  Z  (remember,  the  objective  is  to  maximize  Z)? 
The  answer  is  provided  by  constructing  a  line  on  the  graph  representing  the 
objective  function.  This  can  be  done  by  arbitrarily  selecting  a  value  for 
Z,  e.g.,  Z  *  5000  ■  75X  ^  +  50X2,  drawing  that  line  on  the  graph  as 

shown  in  Figure  16-11.  Any  value  of  Z  selected  will  yield  an  "iso-profit" 
line  parallel  to  the  first.  Since  the  objective  here  is  to  maximize  Z,  the 
problem  is  to  find  the  iso-profit  line  corresponding  to  the  largest  value 
of  Z,  while  still  including  a  point  in  the  feasible  region.  By  inspection, 
it  is  apparent  that  as  iso-profit  lines  are  constructed  in  the  direction 
away  from  the  origin,  the  value  of  Z  increases,  and  vice  versa. 

Figure  16-11 


INCREASING  VALUES  OF  Z 


Inspection  of  the  graph  also  shows  that  the  iso-profit  line  intersecting 
point  B  will  result  in  the  greatest  value  of  Z.  Point  B  represents  the 
intersection  of  the  constraint  lines: 


600 


X  2  -  36 

5X  1  +  8X2  - 

solving:  5X^  +  8(36)  *  600 

XI  -  312/5  *  62.4 


Therefore,  the  optimal  feasible  solution  to  this  problem  is  X^  *  62.4, 

X2  -  36,  and  z  -  75(62.4)  +  50(36)  -  4fi80  +  1800  -  $6480.  The  fact 
that  the  optimal-solution  fell  at  a  corner  point  of  the  feasible 
region  is,  of  course,  not  merely  a  coincidence.  It  is  characteristic  of 
all  LP  models.  A  set  of  constraints  which  are  expressed  as  linear  inequali¬ 
ties  must  necessarily  form  a  convex  set.  Further,  given  a  linear  objective 
function,  regardless  of  its  slope  of  the  direction  of  optimization,  the 
optimal  solution  will  always  include  a  corner  (extreme)  point  of  a  linear 
convex  set.  This  characteristic  holds  whether  we  are  dealing  with  two, 
three,  or  n  decision  variables/dimensions.  This  characteristic  is  the  key 
to  the  systematic  solution  of  all  LP  models  because  it  means  one  need 
investigate  only  a  finite  number  of  corner  points  to  be  assured  of  finding 
an  optimal  feasible  solution  if  one  exists.  Since  corner  points  are,  in 
effect,  defined  by  the  intersection  or  two  or  more  constraints,  the  syste¬ 
matic  determination  of  an  optimal  solution  is  mathematically  equivalent  to 
the  iterative  solution  of  sets  of  simultaneous  linear  equations  defining 
the  respective  corner  points  of  the  feasible  region. 


Aberrations  in  LP  Models 


In  building  and  solving  LP  models  of  various  decision  systems,  it  is 
possible  to  encounter  a  number  of  aberrations  to  the  general  LP  model  and 
•its  solution.  Three  such  contingencies  are  commonly  encountered:  (1) 
alternative  optimal  solutions,  (2)  unbounded  solutions,  and  (3)  nonexisting 
feasible  solutions* 


Alternate  Optimal  Solutions 

Consider  the  following  LP  decision  system  model : 

maximize:  Z  *  X„  +  2X„ 

1  2 

s.t. :  2X1  +  4X2  <  9 

3XX  +  X2  <  12 

Xj_,  Xj  >  0 
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A  graph  of  this  model  is  illustrated  in  Figure  16-12: 


Figure  16-12 

Pi 
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Note  that  the  objective  function  for  Z  ==  3  is  parallel  to  the  constraint 
2X^  +  4X2  <  9*  It  is  apparent  that  when  the  objective  function  line  is 
(graphically)  moved  away  from  the  origin  in  an  attempt  to  maximize  Z, 
points  C  and  D  will  be  encountered  simultaneously,  as  will  the  infinite 
number  of  points  defining  the  line  connecting  these  points.  In  fact,  the 
optimal  solution  occurs  when  the  objective  function  is  coincident  with  line 
2X^  +  4X2  =*  9.  Any  point  on  the  line  segment  connecting  points  C  and  D 
will  yield  the  same  optimal  value  of  Z  (in  this  case,  Z  *  4.5).  Whenever 
the  objective  function  is  parallel  to  a  binding  constraint,  an  infinite 
number  of  alternate  optima  exist. 


Unbounded  Solutions 


Consider  the  following  LP  decision  system  model: 

optimize:  Z  ■  2X^  +  X^ 


s 


< 


ts 


i 


1 


Note  that  the  objective  function  line  with  Z=8  has  been  included  as  a 
reference.  If  the  optimization  objective  for  this  problem  was  to  minimize 
Z,  it  is  apparent  that  the  optimal  feasible  solution  will  be  at  point  B, 
the  extreme  (corner)  point  of  the  solution  space  that  is  closest  to  the 
origin  (X^  ~  2,  X2  =  3,  Z  =  7).  Suppose,  however,  that  the  stated  objective 
was  to  maximize  Z.  Since  the  feasible  region  is  unbounded  (in  this  case) 
in  the  direction  away  from  the  origin,  there  is  no  limit  to  the  distance 
away  from  the  origin  at  which  objective  function  lines  can  be  constructed. 
Mathematically,  Z  can  be  increased  without  bound.  When  this  situation  is 
encountered,  it  indicates  that  the  decision  system  model  has  been  for¬ 
mulated  incorrectly.  Go  back  and  check  your  analysis  and  logic. 

No  Feasible  Solution 


In  many  instances,  a  decision  situation  may  be  described  for  which  no 
feasible  solution  exists,  i.e.,  it  is  not  possible  to  simultaneously 
satisfy  all  of  the  conditions  or  constraints  on  the  problem.  This  is  par¬ 
ticularly  true  for  systems  having  large  numbers  of  constraints  -  iecis; 
systems  that  are  "tightly  bound".  Consider  the  following  example: 


optimize:  Z  *  2X  ^  +  X2 

s.t.  X  L  +  X  2  i  4 
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UNCLASSIFIED 


Figure  16-14  illustrates  the  solution  space  for  this  model: 


Figure  16-14 


It  is  apparent  that  there  is  no  point  that  can,  at  the  same  time,  meet  all 
three  conditions*  In  this  case,  there  is  no  solution  to  the  decision 
situation  as  it  has  been  described  in  the  model* 

THE  SIMPLEX  METHOD 

While  the  graphical  solution  procedure  works  well  for  very  simple  problems 
having  only  two  decision  variables,  there  are  few  realistic  problems  that 
can  be  adequately  modeled  in  two  dimensions.  For  problems  involving  hundreds 
of  variables  and  constraints,  a  more  powerful  and  efficient  technique  is 
required.  This  technique  is  the  simplex  method.  The  simplex  method  is  a 
straightforward  procedure  that  begins  with  an  initial  feasible  solution  (i.e., 
a  corner  point  of  the  feasible  region)  and  sequentially  proceeds  toward  the 
optimal  solution  by  examining  adjacent  points  in  the  solution  space.  The 
procedure  iterates  or  moves  toward  the  optimal  solution  in  such  a  way  that 
each  new  solution  examined  is  better  than  the  previous  one.  When  a  solu¬ 
tion  is  examined  and  found  to  be  less  optimal  them  the  previously  investi¬ 
gated  corner  point,  the  procedure  terminates.  The  simplex  algorithm 
procedes  through  the  following  basic  steps: 
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(1)  Determine  an  initial  basic  feasible  solution; 

(2)  Check  the  solution  for  optimality: 

a*  if  optimal/  stop; 

b.  if  not  optimal,  continue; 

(3)  Locate  the  adjacent  corner  point  of  the  solution  space  that 
makes  the  greatest  contribution  to  optimizing  Z; 

(4)  Go  to  step  (2). 

Mechanically  or,  more  appropriately,  mathematically,  the  simplex  proceeds 
by  solving  the  subset  of  simultaneous  linear  equations  which  define  a  par¬ 
ticular  corner  point  or  solution.  If  the  solution  obtained  is  not  optimal, 
a  new  subset  of  constraint  equations,  corresponding  to  one  of  the  adjacent 
extreme  points,  is  systematically  selected  and  solved.  The  iterative  pro¬ 
cess  continues  until  the  optimal  solution  is  determined. 

Standard  (Canonical)  Form  of  the  LP  Model 

The  first  step  in  the  simplex  solution  procedure  is  to  convert  the  LP 
model  resulting  from  your  analysis  of  the  decision  system  of  interest  to  a 
standard  or  canonical  form  having  the  following  characteristics:  (1)  all 
structural  (nonnegativity)  constraints  are  expressed  as  equalities;  (2)  the 
right-hand  side  constants  in  the  constraint  set  are  all  nonnegative;  (3)  all 
variables  are  nonnegative;  (4)  the  objective  is  maximization.  Converting 
the  LP  model  to  a  standard  form  greatly  facilitates  the  solution  procedure, 
particularly  when  being  accomplished  manually,  i.e.,  without  one  of  the 
many  available  LP  computer  codes.  In  summation  notation,  the  standard  form 
of  the  LP  model  is: 

n 

maximize:  Z  ■  J  c4  x4 
)-i  3  3 
n 

S*t*  ^  *ij  *3  "  bi  (t  *  . . . 

Xj  >  0  ■  1,2, . ,n) 

Conversion  of  an  LP  Model  to  Standard  Form 

In  general,  the  LP  model  resulting  from  analysis  of  the  decision  system 
will  have  mixed  constraints,  i.e.,  equalities  and  inequalities  ( <,  >,  or  both). 

Conversion  of  <  Constraints. 

For  each  <  constraint  i,  introduce  a  slack  variable  (Si)  to  the  left  side 
of  the  expression  and  convert  the  inequality  to  an  equality.  For  example, 
given: 

maximize:  Z  -  SX^  +  8X2 

s.t.  3Xl  +  4X2  i  12 

2X1  +  3X2  S  6 
Xj,  X2  2  0 
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converting  to  standard  form,  this  becomes: 
maximize:  Z  -  5Xx  +  8X2 

s.t.  3Xx  +  4X2  +  Sx  -  12 

2Xx  +  3X2  +  S2  =  6 

Xx»  X2,  Sx»  S2  SO 

Since  the  left  side  of  the  constraint  must  be  <  the  right  side  constant 
(bj.) ,  the  variable  represents  the  "slack"  between  the  left  and  right  sides 
of  the  equality.  In  physical  terms,  if  bx  represents  the  total  amount  availa¬ 
ble  of  a  particular  resource,  S£  represents  the  unused  portion  of  that  total. 

Of  course,  when  all  of  the  available  resource  i  (b^)  is  used  and  the  constraint 
is  "binding",  S£«0.  Note  that  slack  variables  do  not  appear  in  the  objective 
function  or,  alternatively,  they  appear  in  the  objective  function  with  a 
coefficient  (cj)  equal  to  zero. 

Conversion  of  >  Constraints 

For  each  >  inequality  constraint,  the  left  side  must  be  at  least  equal  to 
the  right  side  value  b^.  It  can  be  greater,  but  it  cannot  be  less.  The  i 
inequality  is  converted  to  an  equality  by  subtracting  a  surplus  variable  (E^) 
from  the  left  side.  In  addition,  it  is  also  necessary  to  introduce  another 
variable,  called  an  artificial  variable  (Ax).  The  artificial  variable  is 
necessary  to  provide  a  starting  point  in  the  solution  process,  i.e. ,  an  ini¬ 
tial  basic  feasible  solution.  The  rationale  for  introducing  the  artificial 
variable  will  be  further  discussed  shortly.  As  an  example: 

maximize:  Z  -  5Xj  +  8X2 


s.t.  3XX  +  4X2  >  12 

2XX  +  4X2  £  6 

xx  +  x2  £  0 

converting  to  standard  form: 

maximize:  Z  »  5X^  +  8X2 

s.t.  3Xx  +  4X2  -  *1  ♦  *1  -12 
2Xx  +  3X2  -  E2  ♦  *2  -  « 
xl»  x2'  El'  E2»  *1»  »2  i  0 


Unfortunately,  there  is  no  guarantee  that  one  or  more  of  the  artificial 
variables  won’t  show  up  in  the  final  solution  (at  a  nonzero  level) .  A  number 
of  "tricks"  have  been  devised  to  preclude  artificial  variables  in  the  final 
solution.  The  most  popular  of  these  is  the  method  of  penalties  -  more  com¬ 
monly  referred  to  as  the  "Big  M”  method.  Since  the  objective  is  to  maximize, 
introducing  A^  in  the  objective  function  with  a  very  large  -Cj  coefficient 
(represented  by  "M")  insures  that  if  any  A^  shows  up  in  solution,  a  very  large 
penalty  in  terms  of  the  objective  is  incurred.  Therefore,  the  standard  form  of 
the  example  is: 

maximize:  Z  “  5X^  +  8X2  -  MA^  -  MA2 

S.t.  3XX  +  4X2  “  Ex  +  Ax  -12 

2XX  +  3X2  -  E2  +  A2  ■  6 

xx ,  x2 ,  ,  E2 ,  A^ ,  ®2  k  0 
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Equality  Constraints 

Many  LP  models  have  equality  constraints  as  well  as  those  inequalities 
just  discussed.  On  the  surface,  it  would  seem  that  no  action  is  required 
here  to  convert  it  to  standard  form.  Unfortunately,  such  is  not  the  case. 
For  purposes  of  finding  a  starting  point  (as  in  the  case  of  the  >  inequality) 
it  is  necessary  to  introduce  an  artificial  variable  (A^)  in  each  equality 
constraint  and  a  term  in  the  objective  function.  To  summarize,  then, 

consider  the  following  LP  model: 

maximize:  Z  *  2X^  +  5X2 

s.t.  Xx  +  3X2  <  30 

4XX  +  2X2  £  15 

X1  +  x2  *  10 

xlf  x2  >  0 


converting  to  standard  form,  this  becomes: 

maximize: 
s  .t. 


*1  ,  X2  ,  *2  ,  *2  ,  *3  '  « 

Conversion  of  Negative  Right  Side  Values 

Recall  that  in  its  standard  form,  negative  b^  values  are  not  permitted. 
Should  a  -bjL  value  result  from  the  decision  system  analysis,  simple  multiply 
both  sides  of  the  constraint  by  -1.  If  the  constraint  is  an  inequality, 
multiplying  through  by  -1  reverses  the  sense  or  direction  of  the  inequality. 
Once  the  -b^  value  has  been  converted  to  a  +b^,  we  proceed  as  before,  intro¬ 
ducing,  as  necessary,  slack,  surplus,  and/or  artificial  variables.  Consider, 
for  example,  the  constraint: 

Xj^  +  X2  A  -5 

to  convert  to  standard  form,  multiply  both  sides  by  -1: 

-X1  -  x2  <  5 

and  add  a  slack  variable  to  convert  the  inequality  to  an  equality: 

-XX  -  X2  +  sx  -  5 

Conversion  of  Minimization  Objectives 

While  it  is  not  absolutely  necessary  to  convert  minimization  problems  to 
maximize  problems ,  it  is  convenient  to  do  so  because  only  one  set  of  proce¬ 
dural  rules  need  be  mastered.  Here,  too,  the  conversion  procedure  is 


z  =  2XL  +  5X2  -  MA2 


-  MA, 


*1 

4Xi 


3X2 

2X2 

*2 


-  *2 


*2 

*3 


30 

15 

10 


straightforward:  simply  multiply  by  -1.  In  other  words: 


n 


n 


minimizing  Z  ■  J  C ,X , 
j-1  3  3 


is  equivalent  to  maximizing  (-E) 


l 

j-1 


(-c^x 


j 


The  Solution  Procedure 


Converting  the  LP  model  to  standard  form  effectively  results  in  a 
system  of  simultaneous  linear  equations*  In  general,  in  any  such  system 
having  m  equations  and  n  unknowns,  there  can  be  a  unique  solution,  an  infi¬ 
nite  number  of  solutions,  or  no  solution,  depending  on  the  relative  size  of 
m  and  n.  When  n  >  m,  any  m  variables  can  be  expressed  in  terms  of  the 
remaining  (n-m)  variables.  Consequently,  there  are,  in  this  case,  an  infi¬ 
nite  number  of  solutions.  This  is  essentially  the  situation  when  an  LP 
model  has  been  converted  to  standard  form.  If  the  original  model  had  n 
(decision)  variables  and  m  constraints,  when  converted  to  standard  form,  at 
least  one  slack  (Si),  surplus  (Ei),  and/or  artificial  variable  is  intro¬ 
duced  in  each  of  the  m  constraints.  Consequently,  the  standard  form  trill 
have  at  least  (m+n)  variables  and  m  constraint  equations.  However,  a  unique 
solution  can  be  obtained  by  setting  any  n  unknown  variables  to  zero  and 
solving  for  the  remaining  m  unknown  variables.  At  the  maximum,  i.e., 
through  exhaustive  explicit  enumeration,  the  total  number  of  solutions  to 
be  examined  corresponds  to  the  total  number  of  ways  of  selecting  m 
variables  from  (m+n)  variables.  This  is  given  by: 


(n+m)  (n+m)  1  -  (n+m)! 

tb  “  "(n+m-m)  imi  nlml 


Consider  again  Example  16-4  in  which  the  resulting  LP  model  had  12  variables 
and  7  constraints: 


(12+7)  19! 

12  "  12!  7! 


50,388 


Fortunately,  of  this  total,  we  need  only  look  at  the  feasible  subset  in 
which  Xj  i.  0.  Since,  as  we  have  seen,  the  optimal  solution  will  be  at  one 
of  the  corners  or  extreme  points  of  this  feasible  region,  the  problem  is 
actually  much  more  manageable  than  it  might  seem.  Because  the  simplex  pro¬ 
cedure  is  predicated  on  these  structural  characteristics  (and  because  effi¬ 
cient  simplex- baaed  computer  codes  have  been  developed) ,  it  can  rather 
efficiently  solve  problems  having  hundreds  of  variables  and  constraints. 
Before  briefly  describing  the  simplex  algorithm,  it  will  be  convenient  to 
introduce  a  few  basic  terms: 


Basic  Solution:  Any  point  defined  by  the  intersection  of  two  (or 
more)  constraint  equalities.  Basic  solutions  can  be  feasible  or 
infeasible.  For  example,  in  Figure  16-15,  points  A,  B,  D,  F,  and  G 
are  Basic  Feasible  Solutions.  Points  C,  S,  and  H  are  Basic  Infeasible 
Solutions . 


Solution  Basis:  The  m  variables  selected  for  a  solution  are  said  to 
constitute  a  (solution)  basis. 
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These  m  variables  are  referred  to  as  basic  variables  while  the 
remaining  n  variables,  arbitrarily  set  equal  to  zero,  are  referred  to 
as  nonbaslc  variables. 

Degeneracy;  In  a  basic  feasible  solution,  all  of  the  m  basic 
variables  are  >  0.  The  n  nonbasic  variables  are,  of  course,  set  equal 
to  zero.  However,  in  some  cases,  one  or  more  of  the  basic  variables 
may  be  equal  to  zero.  When  this  occurs,  the  basic  feasible  solution 
is  said  to  be  degenerate. 

The  next  step  in  the  process  is  to  convert  the  standardized  LP  model  to 
a  table  or  tabular  form  in  order  to  facilitate  the  mechanics  of  the  simplex 
procedure.  To  illustrate,  consider  the  following  simple  example: 


minimize:  Z 
s.t. 


3*1  +  *2 
*1  +  *2 


<  5 

*2  ^  1 
Zj  >  0 


(1) 

(2) 

(3) 


converting  to  standard  form: 

maximize:  (-Z)  ■  -3X^  -  Xj  -  + 

s.t.  *1  +  %  +  *1  "  3 

*1  +  %  »  5 

*2  “  ®3  +  *3  "  1 
*1  •  *2'  *1'  %  '  '  *3  -  0 


+  OE3  -  MA3 


this  aodsl  can  be  expressed  as  the  following  linear  system: 


-1Z  + 

3XX  4 

*2 

4- 

MA1 

— 

os2 

— 

0B3 

4’ 

**3 

0Z  + 

1X1  + 

1X2 

4- 

1A1 

4> 

os2 

4- 

0E3 

4 

0A3 

oz  + 

1XX  + 

ox2 

+ 

0Ai 

4- 

1S2 

4- 

0E3 

+ 

0A3 

oz  + 

0XX  4- 

ix2 

4- 

°Ai 

+ 

os2 

- 

1B3 

4- 

1A3 

This  same  system  can  be  somewhat  more  efficiently  described  using  the 
following  table: 

Table  16-1 


ROW  BASIS 
(0)  Z 

(1) 

(2)  $2 

(3) 


Z 

-1 

0 

0 

0 


3 

1 

1 

0 


1 

1 

0 

1 


M 

M 

1 

0 

0 


s2 

0 

0 

1 

0 


®3 

0 

0 

0 

-1 


M 

M 

0 

0 

1 


0 

3 

5 

1 


Having  constructed  this  initial  table,  the  procedure  continues  as  follows: 

( 1 )  Select  a  convenient  starting  basis  of  m  variables  which  is 
feasible  (find  an  initial  basic  feasible  solution). 


The  basic  variables  are  those  which  form  an  identity  matrix,  i.e.,  those 
variables,  in  addition  to  Z  (which  is  always  in  the  basis),  for  which  the 
column  vector  contains  a  single  "4-1"  entry  and  0  in  the  remaining  positions. 
At  this  point,  only  Sfc  satisfies  the  identity  condition.  However,  through 
appropriate  algebraic  manipulation,  the  "M"  entries  can  be  cleared  out  of  the 
columns  corresponding  to  variables  A^  and  Aq ,  thereby  providing  an  initial 
basic  feasible  solution.  Specifically,  if  each  entry  in  row  ( 1 )  is  multi¬ 
plied  by  -H  and  added  to  the  respective  values  in  row  (0)  (i.e.,  the  elimi¬ 
nation-substitution  technique),  the  following  matrix  results: 

Table  16-2 

ROW  BASIS  Z  Xj  X2  Ki  Sj  S3  A3  b^  ri 
(0)  Z  -1  3-M  1-M  0  0  0  N  -3M 

( 1 )  Aj  0  1  1  1  0  0  0  3 

(2)  Sj,  0  1  0  0  1  0  0  5 

(3)  0  0  1  0  0  -1  1  1 

Similarly,  to  clear  the  zero  from  row  (0),  column  Ag,  we  multiply  row  (3) 
by  -M  and  add  it  to  row  ( 0 ) : 

Table  16-3 

ROW  BASIS  Z  X,  X*  A,  S*  B«  A:  b,  r. 


This  matrix  now  provides  an  initial  basic  feasible  solution  which  can 
be  read  directly  from  the  table:  Aj  ■  3,  S2  *  5,  A3  ■  1,  and  £  -  -4M.  The 
other  variables  are,  at  this  point,  nonbasic  and  set  equal  to  zero. 

Looking  back  at  Table  16-1,  it  becomes  somewhat  easier  to  see  why,  in 
converting  the  LP  model  to  standard  form,  it  is  necessary  to  include  an 
artificial  variable  for  each  equality  (■)  and  (>)  inequality  constraint. 

Had  Ai  and  A3  not  been  introduced  here,  we  would  not  have  been  able  to 
readily  determine  an  initial  basic  feasible  solution  -  we  would  not  have 
had  an  identity  matrix. 

(2)  Check  the  solution  for  optimality 

The  next  step  in  the  simplex  algorithm  is  to  determine  if  the  solution 
just  obtained  is  optimal.  This  can  be  done  by  inspecting  the  table.  The 
values  in  row  (0)  represent  the  marginal  contribution  to  the  objective  when 
a  unit  of  the  respective  nonbasic  variable  is  brought  into  the  solution 
basis.  Remember,  the  basic  variables  will  have  a  ”0"  in  this  position. 

When  all  of  the  values  in  row  (0)  are  >  0,  no  further  improvement  is 
possible  and  the  optimal  solution  has  been  found.  Recall  that  in  the  pro¬ 
cess  of  converting  the  standard  form  of  the  LP  model  to  the  tabular  form, 
the  signs  of  the  coefficients  in  the  objective  function  were  reversed. 
Consequently,  the  test  for  optimality  reflects  the  condition  of  all  objec¬ 
tive  function  coefficients  being  <  0  when  expressed  in  standard  form. 

Since  the  objective  is  to  maximize  Z,  introducing  any  units  of  a  variable 
for  which  the  coefficient  is  £  0  cannot  increase  Z  and  may  very  likely 
decrease  it.  To  reiterate,  the  solution  to  the  problem  is  optimal  when 
all  values  in  row  (0)  of  the  table  are  i.  0.  In  this  example ,  both  X^  and 
have  very  large  negative  coefficients.  Consequently,  this  initial  basic 
feasible  solution  is  not  optimal. 

(3)  Select  the  entering  variable 

To  proceed  toward  the  optimal  solution  as  efficiently  as  possible,  the 
simplex  procedure  moves  from  the  current  solution  point  to  the  adjacent 
extreme  point  in  the  feasible  region  that  results  in  the  greatest  contribu¬ 
tion  to  the  objective,  i.e.,  to  the  greatest  increase  in  Z.  Procedurally, 
this  is  accomplished  by  bringing  one  of  the  nonbasic  variables  into  the 
solution  basis.  In  doing  so,  one  of  the  basic  variables  is  driven  out. 

The  entering  variable  is  that  nonbasic  variable  having  the  most  negative 
coefficient  in  row  (0)  of  the  table.  Looking  at  Table  16-3,  has  a 
coefficient  of  (3-M)  while  Xg  has  a  coefficient  of  ( 1-2M) .  Therefore,  Xg 
becomes  the  entering  variable  (because  -2M  is  more  negative  than  -N). 

(4)  Select  the  leaving  variable 


As  X^  enters  the  solution  basis,  which  variable  is  driven  out  (i.e.,  A^, 
S g,  or  A3)?  Looking  at  the  Xg  (column)  vector  in  Table  16-3  for  each  unit  X 
is  increased,  A3  is  decreased  by  1,  Sg  is  decreased  by  0,  and  A.  is 
decreased  by  1.  The  relevant  question  then  becomes  how  many  units  the 
entering  variable  Xg  can  be  increased  before  either  A. ,  S- ,  or  A.  is 


exhausted  and,  therefore,  eliminated  from  the  solution  basis.  The  answer, 
of  course,  depends  on  the  amount  of  each  basic  variable  (bj.)  and  the  rela¬ 
tive  rate  at  which  it  is  consumed.  By  calculating  the  ratio  r j_  »  bi/ai j  * , 
where  aij*  is  the  positive  ( >0)  constraint  coefficient  for  the  entering 
variable  j  and  basic  variable  i,  the  relative  rate  at  which  the  various 
basic  variables  are  driven  out  of  the  basis  can  be  determined.  These 
ratios  appear  in  the  right  end  column  of  the  table.  The  smallest  of  these 
values  identifies  the  basic  variable  driven  to  zero  first  as  the  enter¬ 
ing  variable  is  increased.  In  this  example,  X2  can  be  increased  by  only  1 
before  A3  is  driven  to  zero.  Consequently,  A3  is  the  leaving  variable. 

(5)  Modify  the  table  to  reflect  the  new  solution 

To  establish  the  new  solution  basis,  we  algebraically  manipulate  the 
linear  system  so  that  the  entering  variable,  in  this  case  X2,  is  included  in 
the  identity  matrix.  Specifically,  the  table  is  manipulated  in  such  a  way  as 
to  result  in  a  "+1”  value  in  the  table  position  defined  by  the  entering  varia¬ 
ble  (X2)  and  the  leaving  variable  (A3).  This  is  the  so-called  "pivot  element". 
In  addition,  the  table  is  modified  by  necessary  algebraic  row  operations  to 
create  zero's  in  all  of  the  other  positions  in  the  column  corresponding  to  the 
entering  variable.  Referring  to  Table  16-3  we  need  to  manipulate  this  linear 
system  in  such  a  way  as  to  create  a  0  in  row  ( 1 )  and  row  ( 0 )  of  column  X  2 .  To 
create  the  0  in  row  ( 1 ) ,  we  multiply  the  "operator  row"  ( 3 ) ,  which  contains 
the  pivot  element,  by  -1  and  add  the  respective  values  to  those  of  row  ( 1 ) : 


Table 

16-4 

ROW 

BASIS 

Z 

Xl 

X  2 

*1 

S  2 

X  3 

A  3 

b 

r 

(0) 

Z 

-1 

3-M 

1-2M 

0 

0 

M 

0 

-1-4M 

(1) 

*1 

0 

1 

0 

1 

0 

1 

-1 

2 

(2) 

S2 

0 

1 

0 

0 

1 

0 

0 

5 

(3) 

xa 

0 

0 

1 

0 

0 

-1 

1 

1 

Similarly,  to 

create 

a  zero 

in  • 

cow  (0) 

Of  ' 

column  X-, 

the  operator  row 

C 

is  multiplied 

by  (-1+2M)  and 

added  to 

the  < 

corresponding  elements  of  row 

Table 

16-5 

ROW 

BASIS 

Z 

XI 

X2 

A 1 

S2 

Eg 

a3 

b 

r 

(0) 

Z 

-1 

3-M 

0 

0 

l 

D 

1-M 

-1+2M 

-2-2M 

(1) 

A1 

0 

1 

0 

1 

1 

9 

1 

-1 

2 

2 

(2) 

S2 

0 

1 

0 

0 

1 

0 

0 

5 

- 

(3) 

x2 

0 

0 

1 

0 

1 

9 

-1 

1 

1 

- 

One  iteration  of  the  simplex  procedure  has  been  completed.  We  now  return 
to  step  (2)  to  determine  if  this  new  solution  is  optimal  and  continue  the  pro¬ 
cess  until  the  optimal  solution  is  determined.  Inspecting  row  (0)  in  Table 
16-5  it  is  apparent  that  the  solution  is  not  optimal  because  not  all  of  the 
values  in  row  (0)  are  >  0.  Since  the  most  negative  coefficient  corresponds 
to  E  3,  it  enters  the  basis.  Since  there  is  only  one  positive  *15*  value  in 
the  E3  column,  corresponding  to  Ax ,  Ax  must  be  the  leaving  variable.  To  re¬ 
flect  this  new  basis,  the  system  is  manipulated  to  include  E3  in  the  identity 
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matrix .  To  creata  a  zero  in  row  (3),  column  E 3,  the  operator  row  (1)  is 
added  to  row  ( 3 ) : 


Table  16-6 


ROW 

BASIS 

Z 

*1 

X2 

Al 

s2 

e3 

A3 

b  r 

(0) 

Z 

-1 

3-M 

0 

0 

0 

1-M 

-1+2M 

-2-2M 

(1) 

e3 

0 

1 

0 

1 

0 

1 

-1 

2 

(2) 

S2 

0 

1 

0 

0 

1 

0 

0 

5 

(3) 

X2 

0 

1 

1 

1 

0 

0 

0 

3 

Then, 

to  create  a  zero 

in 

row 

(0), 

column  E3, 

multiply  the  elements  of  the 

operator  row 

(1)  by 

(- 

1+M) 

and 

add 

the  respective  to 

those  in 

row  ( 0 ) : 

Table 

16-7 

ROW 

BASIS 

Z 

*1 

*2 

Al 

S2  E3  A3 

b  r 

(0) 

Z 

-1 

2 

0 

-1+M 

0 

0  M 

0 

(1) 

*3 

0 

1 

0 

1 

0 

1  -1 

2 

(2) 

S2 

0 

1 

0 

0 

1 

0  0 

5 

(3) 

*2 

0 

1 

1 

1 

0 

0  0 

3 

Inspecting  row  (0),  all  coefficients  for  the  nonbasic  variables  are  >  0. 
This  solution  is  optimal.  The  optimal  policy  then  is  to  let  Xj  *  0  and 
X2  ■  3.  This  results  in  a  value  of  Z  “  0.  In  addition,  inspection  of  this 
final  table  indicates  several  other  important  pieces  of  information.  It 
suggests  that  under  this  optimal  policy,  Ej  ■  2  or  that  2  "extra"  units  of 
the  requirement  stipulated  by  constraint  (3)  will  be  produced.  In  addi¬ 
tion,  since,  under  the  optimal  policy#  S2  ■  5,  5  units  of  the  resource 
represented  by  constraint  (2)  will  remain  unused.  While  the  simplex  proce¬ 
dure  may  appear  to  be  rather  cumbersome  to  accomplish  manually,  it  is 
readily  performed  by  the  computer ,  and  many  very  efficient  LP  computer 
codes  are  available. 

Abberatlons  in  the  Simplex  Algorithm 

In  earlier  discussion,  a  number  of  abberations  were  introduced,  e.g. , 
alternative  optimal  solutions,  unbounded  solutions,  and  no  feasible  solu¬ 
tion.  How  are  these  conditions  identified  in  the  simplex  procedure? 

Alternative  Optimal  Solutions 

Recall  that  in  the  two  variable  situation,  alternative  optima  exist  when 
the  objective  function  is  parallel  to  a  binding  constraint.  Consider  the 
exastple: 

maximise:  Z  -  X  j  +  3X  2 

s.t.  2X2  +  6X2  £  12 

3Xi  +  2X2  S  6 
X1#  X22  2  0 


Converting  to  standard  form: 

maximize:  Z  *  X j  +  3X2  +  OSj  +  0S2 


s.t.  2X !  +  6X2  +  Si  =12 

3Xj  +  2X2  +  S2  =6 

Xi,  X2,  Sp  s2  >  0 


In  tableau  form, 

ROW 

this 

BASIS 

becomes 

Z 

f 

*1 

Table  16-8 

x2  sx 

s2 

bi 

ri 

(0) 

Z 

1 

-1 

-3  0 

0 

0 

(1) 

Si 

0 

2 

6  1 

0 

12 

2 

(2) 

S2 

0 

3 

2  0 

1 

6 

3 

The  initial  basic  feasible  solution  is  not  optimal.  X2  is  the  entering 
variable.  As  X2  enters,  Sj  is  driven  out  of  solution  first  ( r  p  =  2)  so  it 
( S i )  is  the  leaving  variable.  Adjusting  the  table,  we  need  to  create  a  "1" 
in  the  pivot  element  position  of  row  (1),  column  X2,  and  a  "0"  in  the  other 
two  positions  of  this  column.  The  new  solution  is: 

Table  16-9 

ROW  BASIS  Z  Xj  x2  Sx  S£  bp  rp 
(0)  Z  1  0  01/20  6 

(1)  X2  0  1/3  1  1/6  0  2 

(2)  S2  0  7/3  0  -2/6  1  2 

Since  all  the  coefficients  in  row  (0)  are  >  0,  this  solution  is  optimal 
Mow,  note  that  nonbaslc  variable  Xi  has  a  coefficient  of  0  in  row  (0)  of 
the  optimal  solution.  Bringing  Xj  into  the  solution  has  no  effect  on  Z, 
implying  that  alternative  optimal  solutions  exist. 

Unbounded  Solutions 

Consider  again  the  following  example: 

maximize:  Z  =  2Xj  +  Xi 

s.t.  Xj  +  X2  >  5 

XX  >2 

X2  >  1 

Converting  to  standard . form: 


maximize:  Z  ■  2Xj  +  X2  +  OEj  -  MAj  +  OE2  -  MA2  +  0E3  -  MA3 

s.t.  Xp  +  X2  -  Ej  +  Aj  =5  (1) 

Xj  -  E2  +  A2  =2  (2) 

X2  -  E3  +  A3  -  1  (3) 

X.,  V  Ai  -  0  <3  “  i  =  1,2,3) 


In  table  form 


Table  16-10 


ROW  BASIS 

Z 

X1 

x2 

Ei 

A1 

e2 

A2 

E3 

A3 

bi 

ri 

(0)  Z 

1 

-2 

-1 

0 

M 

0 

M 

0 

M 

0 

(D 

0 

1 

1 

-1 

1 

0 

0 

0 

0 

5 

(2) 

0 

1 

0 

0 

0 

-1 

1 

0 

0 

2 

(3) 

0 

0 

1 

0 

0 

0 

0 

-1 

1 

1 

To  obtain 
columns  Ai 

an  initial  basic  feasible  solution, 

,  A2 ,  and  A 

Table  16-11 

create 

a  zero 

in  row 

(0) 

for 

ROW  BASIS 

Z 

X1 

x2 

Ei 

A1 

E2 

A2 

E3 

A3 

bi 

ri 

(0) 

Z 

1 

-2-2M 

-1-2M 

M 

0 

H 

0 

H 

0  -8M 

(D 

A1 

0 

1 

1 

-1 

1 

0 

0 

0 

0 

5 

5 

(2) 

a2 

0 

1 

0 

0 

0 

1 

1 

0 

0 

2 

2 

(3) 

A3 

0 

0 

1 

0 

0 

0 

0 

-1 

1 

1 

The 

initial 

basic 

feasible  solution  is 

not 

optimal.  Xj 

enters 

and 

A2 

leaves .  The 

new 

solution  is: 

Table 

16- 

12 

ROW 

BASIS 

Z 

Xi  x2 

Ei  A 

ll 

E2  A2 

E3 

A3 

bi 

ri 

(0) 

Z 

1 

0  -1-2M 

M 

0 

-2-M  -2+2M 

M 

0 

4-4M 

(D 

A1 

0 

0  1 

-1 

1 

1  -1 

0 

0 

3 

3 

(2) 

X1 

0 

1  0 

0 

0 

-1  1 

0 

0 

2 

- 

(3) 

a3 

0 

0  1 

0 

0 

0  0 

-1 

1 

1 

1 

This 

solution  is 

not 

optimal . 

*2 

enters 

and  A3 

leaves 

.  The 

new  solution  is: 

ROW 

BASIS 

z 

Xi 

*2 

Ej 

Table  16 
A1 

-13 

% 

*2 

E3 

X 

bi  ri 

(0) 

Z 

1 

0 

0 

M 

0 

-2-M 

2+2M 

-1-2M 

5-2M 

(1) 

A1 

0 

0 

0 

-1 

1 

1 

-1 

1 

-1 

2  2 

(2) 

X1 

0 

1 

0 

0 

0 

-1 

1 

0 

0 

2 

(3) 

x2 

0 

0 

1 

0 

0 

0 

0 

-1 

1 

1 

The 

solution 

is 

not  yet 

optimal • 

E3  enters  and 

Aj  leaves • 

The  new 

solution 

ROW 

BASIS 

z 

*1 

*2 

Ei 

Table  16- 
A1 

14 

h. 

*1 

e3 

bl  ri 

(0) 

Z 

1 

0 

0 

-1-M  1+2M 

-1+M 

1 

0 

0 

7+2M 

( 1 ) 

E3 

0 

0 

0 

-1 

1 

1 

-1 

1 

-1 

2 

(2) 

*1 

0 

1 

0 

0 

0 

-1 

1 

0 

0 

2 

(3) 

*2 

0 

0 

1 

-1 

1 

1 

-1 

0 

0 

3 
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The  solution  is  still  not  optimal.  E^  is  the  entering  variable,  how¬ 
ever,  all  of  the  a^j  values  in  the  column  vector  for  Ex  are  <.0.  It  is  not 
possible  to  compute  a  positive  r^  for  any  row.  This  indicates  that 
bringing  Ej  into  the  solution  basis  will  not  drive  any  of  the  existing 
basic  variables  out.  The  solution  is  unbounded  (in  the  direction  of 
maximization) . 

Wo  Feasible  Solution 

Consider  again  the  problem: 

maximize:  Z  *  2Xi  +  X2  (0) 

s.t.  +  X2  <  4  (1) 

Xi  >5  (2) 

X2  >'6  (3) 

Rewriting  in  standard  form: 

maximize:  Z  -  2Xi  +  X2  +  OSj  +  0E2  -  MA2  +  OE3  -  MA3 

s.t.  Xi  +  X2  +  Si  «  4 

X!  -  E2  +  A2  -  5 

x2  -  Eg  +  Aj  =  6 

In  tableau  form: 

V)  Table  16-15 

ROW  BASIS  Z  X,  X,  S,  E,  A,  E,  A, 

(0)  1-2-1  0  0  M  0  M 

(1)  01110000 

(2)  0  10  0-1100 

(3)  0  0  1  0  0  0  -1  1 

To  obtain  an  initial  basic  feasible  solution,  create  a  zero  in  row  (0)  for 
columns  A2  and  A3: 

Table  16-16 

ROW  BASIS  Z  Xj  X2  Sj  E2  A2  E3  A3  b±  r± 

(0)  Z  1  -2-M  -1-M  0  M  0  M  0  -11M 

(1)  Sj  0  1  1  1  0  0  0  0  4  4 

(2)  A2  0  1  0  0  -1  1  0  0  5  5 

(3)  A3  0  0  1  0  0  0  -1  1  6  - 

The  solution  is  not  optimal.  X^  enters  and  Sj  leaves.  The  new  solution  is: 

Table  16-17 

ROW  BASIS  Z  Xx  X2  S!  E2  A2  E3  A3  b*  r* 

(0)  Z  1  0  1  2+M  M  0  M  0  8+7M 

( 1 )  Xj  0  1  1  1  0  0  0  0  4 

(2)  A2  0  0  -1  -1  -1  1  0  0  1 

(3)  A3  0  0  1  0  0  0  -1  1  6 
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Looking  at  row  (0),  all  entries  are  >  0.  Hie  solution  is  optimal. 
However,  the  solution  basis  includes  artificial  variables  having  positive 
values.  Whenever  an  artificial  variable  appears  in  the  optimal  solution 
basis  at  a  strictly  positive  level,  there  is  no  feasible  solution.  If  an 
artificial  variable  appears  in  the  optimal  solution  basis  and  is  equal  to 
zero,  the  solution  is  feasible,  although  the  model  may  have  a  redundant 
constraint  or  basic  feasible  solutions  which  are  degenerate. 

Degeneracy 

As  previously  defined,  a  solution  is  described  as  degenerate  when  one 
or  more  of  the  basic  variables  (in  addition  to  the  n  nonbasic  variables 
arbitrarily  set  equal  to  zero)  equal  zero.  In  the  simplex  procedure,  this 
condition  is  indicated  when  there  is  a  tie  for  the  leaving  variable,  i.e., 
as  the  entering  variable  value  is  increased,  two  (or  more)  basic  variables 
are  driven  to  zero  and  out  of  the  basis  simultaneously  (two  or  more  basic 
variables  have  the  same  value).  When  this  occurs,  the  tie  is  broken  by 
arbitrarily  selecting  the  leaving  variable.  (Note:  If  one  of  the 
variables  is  artificial  and  the  other  is  not,  you  should  select  the  artifi¬ 
cial  variable.)  The  only  real  problem  with  degeneracy  is  that  in  very  rare 
situations,  the  simplex  algorithm  "cycles"  and  "stalls  out,"  i.e.,  it 
returns  to  a  previously  generated  nonoptimal  solution  rather  than  toward 
the  optimal  solution.  This  condition  cam  be  corrected,  in  some  cases,  by 
simply  selecting  another  one  of  the  tied  variables.  When  this  remedy  does 
not  correct  the  situation,  more  advanced  procedures,  not  described  here, 
are  available. 
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In  constructing  an  LP  model  £or  a  decision  system,  we  assume  that  all 
system  relationships  are  linear  and  that  the  model  parameters  (cj,  b£,  and 
a^j)  are  constant  and  known  with  certainty.  Obviously,  the  solution  of  an 
LP  problem  can  be  no  better  than  the  quality  (validity)  of  the  assumptions 
upon  which  the  model  is  based.  It  is  inconsistent  (and  very  risky!)  to 
place  a  high  degree  of  confidence  in  a  decision  policy  which  has  been 
derived  from  a  system  model  for  which  the  parameters,  components,  and  rela¬ 
tionships  are  not  known  with  equal  confidence.  Having  obtained  an  optimal 
solution  to  an  LP  model,  the  analyst  needs  to  investigate  the  "sensitivity" 
of  this  optimal  policy  to  changes  in  assumed  relationships  and,  in  par¬ 
ticular,  to  changes  in  values  assumed  for  the  respective  model  parameters. 

If  the  solution  is  relatively  "insensitive"  to  variation  in  model  rela¬ 
tionships  and  parameters,  then  the  analyst  might  feel  reasonably  confident 
in  the  optimal  solution  -  knowing  that  variations  in  the  model  assumptions 
do  not  significantly  affect  the  solution  policy  and  criterion  value.  On 
the  other  hand,  if  postoptimality  analysis  reveals  that  the  optimal  solu¬ 
tion  is  sensitive  and  varies  significantly  with  relatively  minor  variations 
in  the  model  assumptions,  the  decision  taker  should  proceed  with  caution. 
When  the  solution  is  sensitive  and  the  decision  is  important  (in  terns  of 
mission  performance  effectiveness,  cost,  safety,  etc.)  it  would  probab' y  be 
prudent  to  acquire,  if  possible,  the  additional  information  necessary  to 
improve  our  knowledge  of  the  actual  value  of  system  model  parameters  and 
system  relationships.  Of  course,  knowing  when  the  model  is  "good  enough” 
and,  r^latedly,  how  confident  one  needs  to  be  in  the  quality  of  the  model- 
derived  decision  policy  before  committing  to  action  is,  ultimately,  a  func¬ 
tion  of  the  decision  taker's  skill,  experience,  propensity  for,  or  adversion 
to,  risk,  and  value  system. 


This  discussion  focuses  primarily  on  parameter  sensitivity.  However, 
the  analyst  should  also  (at  least)  consider  the  sensitivity  of  the  optimal 
policy  to  adding  or  deleting  system  model  components  (variables)  and/or 
relationships  (constraints).  Also,  the  sensitivity  analysis  techniques 
presented  in  this  section  are  based  on  varying  one  parameter  at  a  time. 
Certainly,  the  simultaneous  variation  of  more  than  a  single  parameter  is 
more  realistic.  However ,  the  methodology  for  such  "multivariate"  sen¬ 
sitivity  analysis  is  rather  complex  and  beyond  the  scope  of  this 
discussion.  For  a  more  complete  and  detailed  treatment  of  postoptimality 
(sensitivity),  you  should  refer  to  one  of  the  references  following  this 
section. 


As  the  name  suggests,  postoptimality  analysis  begins  after  the  optimal 
basic  feasible  solution  to  the  decision  system  model  has  been  detezmined, 
i.e«,  we  begin  with  the  solution  represented  by  the  final  table  or  tableau. 
To  illustrate  this  discussion,  consider  the  following  example: 


maximize:  Z  “  6X^  +  3^ 


5%1  +  6X2  £  30 

2XX  +  Xg  -  8 

Xx  +  i  1 

x1 ,  x2  >  0 


To  solve  this  model,  convert  to  standard  form: 


maximi  '«* 

:  Z  -  6XX  + 

*2  +  °h  - 

M*2  +  1 

OE3  - 

MA3 

(0) 

Sets 

5Xi  + 

6X2  ♦  Si 

-  30 

(1) 

2Xj^  + 

*2  +  *2 

-  8 

(2) 

*1 

-  E3  +  a3 

-  1 

(3) 

Xi,  x2 

In  tabular  form: 

Table 

16-18 

ROW 

BASIS 

Z 

*1 

X2  S! 

*2 

*3 

*3 

bl 

(0) 

1 

-6 

-1  0 

M 

0 

M 

0 

(1) 

0 

5 

6  1 

0 

0 

0 

30 

(2) 

0 

2 

1  0 

1 

0 

0 

8 

(3) 

0 

1 

0  0 

0 

-1 

1 

1 

To  develop  an  initial 

basic 

feasible  solution. 

eliminate 

the  -M* 

1  values 

from  row 

(0): 

Table 
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ROW 

BASIS 

Z 

*1 

*2  Si 

a2 

®3 

*3 

bi 

*i 

(0) 

Z 

1  ■ 

-6-3M 

-1-M  0 

0 

M 

0 

-9M 

(1) 

Si 

0 

5 

6  1 

0 

0 

0 

30 

6 

(2) 

*2 

0 

2 

1  0 

1 

0 

0 

8 

4 

(3) 

A3 

0 

1 

0  0 

0 

-1 

1 

1 

1 

The  solution  is 

not  optimal. 

Xi  enters 

and  A3 

leaves. 

The  new 

solution 

Table 

16-20 

ROW 

BASIS 

Z 

*1 

*2  Si 

A2 

S3 

a3 

bi 

*i 

(0) 

Z 

1 

0 

-1-M  0 

0  - 

6-2M 

6+3M 

6-6M 

(1) 

S1 

'  0 

0 

6  1 

0 

5 

-5 

25 

5 

(2) 

A2 

0 

0 

1  0 

1 

2 

-2 

6 

3 

(3) 

*1 

0 

1 

0  0 

0 

-1 

1 

1 

- 

The  solution  is 

not  optimal. 

S3  enters 

and  A2 

leaves. 

The  new 

solution 

Table 
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ROW 

BASIS 

z 

X1 

X2  $! 

a2 

*3 

a3 

bi 

*i 

(0) 

Z 

1 

0 

2  0 

3+M 

0 

M 

24 

(1) 

81 

0 

0 

7/2  1 

-5/2 

0 

0 

10 

(2) 

*3 

0 

0 

1/2  0 

1/2 

1 

-1 

3 

(3) 

*1 

0 

1 

1/2  0 

1/2 

0 

0 

4 

16-36 


The  solution  is  optimal.  This  final  tableau  contains  a  wealth  of  infor¬ 
mation  for  the  manager.  By  inspection,  you  can  see  that  the  optimal  policy 
is  -  4,  and  X2  ”  0,  resulting  in  Z  *  24.  The  final  table  also  shows 
that,  since  Sj.  ■  10,  only  20  units  of  the  30  units  of  resource  (1)  origi¬ 
nally  available  were  consumed.  The  slack  of  10  means  these  resources  are 
available  for  use  by  the  manager  elsewhere.  Similarly,  the  final  table 
also  indicates  E3  »  3,  i.e.,  in  the  optimal  solution,  3  more  units  than  are 
required  by  constraint  (3)  are  produced.  Additional  information  about  the 
sensitivity  of  the  optimal  solution  policy  to  variations  in  the  assumed 
value  of  the  respective  model  parameters  (Cj,  b^,  a^j)  can  be  obtained 
through  analysis  of  the  final  (optimal)  table. 

Coefficients  of  the  Objective  Function  (Cj  > 

In  performing  sensitivity  analysis  on  the  coefficients  in  the  objective 
function  (Cj),  the  general  objective  is  to  determine  the  range  over  which 
the  respective  variable  coefficient  can  vary  without  changing  the  optimal 
solution.  The  specific  analysis  procedure  used  depends  on  whether  we  are 
considering  the  coefficient  of  a  nonbasic  or  basic  variable. 

Nonbasic  Variables 

Recall  that  the  simplex  procedure  terminates  when  the  test  for  optimality 
indicates  that  all  of  the  coefficients  in  row  (0)  are  £  0.  From  the  table 
corresponding  to  the  optimal  solution  in  the  example ,  you  can  see  by 
inspection  that  X^  is  a  basic  variable  while  X2  is  nonbasic.  Over  what 
range  (AC2)  would  C2  have  to  vary  before  X2  becomes  a  basic  variable, 
changing  the  optimal  solution?  In  order  for  X2  to  enter,  its  corresponding 
coefficient  would  have  to  be  negative ,  indicating  the  basis  is  not  optimal. 
Introducing  Ac2  into  the  table: 


ROW 

BASIS 

z 

Xl 

Table 
*2  Si 

16-22 

*2 

e3 

a3 

bi 

(0) 

Z 

1 

0 

2-AC2  0 

3+M 

0 

H 

24 

(1) 

si 

0 

0 

7/2  1 

-5/2 

0 

0 

10 

(2) 

E3 

0 

0 

1/2  0 

1/2 

1 

-1 

3 

(3) 

XI 

0 

1 

1/2  0 

1/2 

0 

0 

4 

He  can  write  2  -  AC2  >0  or,  alternatively,  AC2  <  2.  Since,  in  the  original 
problem,  C2  *  1,  then  C2  +  AC2  <1+2  or  <3.  This  means  that  the  current 
solution  basis  will  remain  optimal  as  long  as  C2  <  3.  Note  that  there  is 
no  lower  limit  to  this  range.  The  range  over  which  the  objective  function 
coefficient  of  a  nonbasic  decision  variable  can  vary  without  changing  the 
optimal  solution  is  called  the  range  of  insignificance. 


Basic  Variables 

You  perhaps  noted  that  introducing  a  "A"  term  in  the  table  for  a  nonbasic 
variable  had  no  effect  on  the  values  of  the  basic  variables  or  on  the  value 
of  the  criterion  variable  Z.  Such  is  not  the  case  when  we  attempt  sensitivity 


analysis  on  tha  objective  function  coefficient  for  a  basic  decision 
variable*  Beginning  in  the  sane  manner  as  we  did  with  the  nonbasic 
variable/  a  "A”  is  introduced  in  row  (0)  for  the  basic  variable  being 
considered  (e*g*/  Xj): 

Table  16-23 

BOH  BASIS  Z  Xj  X2  Sj  A2  E3  A3  b± 

(0)  Z  1  0-ACj  2  0  3+M  0  M  24 

(1)  Sj  0  0  7/2  1  -5/2  0  0  10 

(2)  Eg  0  0  1/2  0  1/2  1  -1  3 

(3)  Xj  0  11/201/2004 

Notice,  however,  that  by  introducing  the  AC^  tern,  we  no  longer  have  the 
required  identity  matrix*  The  matrix  needs  to  be  manipulated  to  reestablish 
the  identity  condition.  The  revised  table  follows: 

Table  16-24 


BOH 

BASIS 

z 

*1 

x2  SX 

A2 

e3 

a3 

bi  ri 

(0) 

Z 

1 

0 

2+1/2AC2  0 

3+M+1/2AC2 

0 

M 

24+4 AC 2 

(  1) 

0 

0 

7/2  1 

5/2 

0 

0 

10 

(2) 

0 

0 

1/2  0 

1/2 

1 

-1 

3 

(3) 

4 

0 

1 

1/2  0 

1/2 

0 

0 

4 

In  the  process  of  eliminating  the  Ac^  term  from  the  column,  it  was 
introduced  in  the  X2,  A2,  and  b^  columns.  As  in  the  case  of  the  nonbasic 
variable,  we  can  now  write  the  conditions  for  preserving  the  current  solu¬ 
tion  basis: 

For  X2:  2+1/2Ar  >  0 

1/2ACJ  >  -2 

ACi  *  -4 

For  A2:  3+M+1/2ACj  >  0 

1/2 AC j  >  -3-M 
AC x  >  -6-2M 

Since  the  condition  that  AC}  >  -4  also  insures  compliance  with  the 
second  condition,  i*e.,  ACj  >  -6-2M,  this  second  constraint  is  redundant. 
Consequently,  since  Cj  ■  6  in  the  model: 

Ci  ♦  Aci  >6-4  or  Ci  +  ACi  >  2 

Therefore,  as  long  as  the  value  of  Cj  remains  >  2,  the  optimal  solution 
basis  will  not  change.  However,  because  we  are  dealing  with  a  basic 
variable  and  Z  is  now  equal  to  24  +  4AC2 ,  varying  the  value  of  Ci  will  be 
reflected  in  the  value  of  Z.  Since  AC}  >  -4,  Z  could  vary  as  low  as 
24  +  4(-4)  ■  24  -  16  ■  8,  without  reflecting  a  change  in  the  optimal  solu¬ 
tion  basis .  The  range  over  which  the  objective  function  coefficient  of  a 
basic  variable  can  vary  without  chancing  the  optimal  solution  basis  is 


V 


Boundary  Value  Constants  (b^) 

In  conducting  a  sensitivity  analysis  on  the  boundary  value  or  right-hand- 
side  (b^)  constants ,  the  objective  is  to  determine  the  range  over  which  the 
b^  constant  can  vary  without  changing  the  optimal  solution  basis. 


Let's  consider  the  first  constraint:  5X^  +  6X2  5  30.  He  would  like  to 


know  the  range  on  b  such  that  the  optimal  solution  remains  unchanged. 
For  purposes  of  illustration,  let's  rework  the  example  problem,  replacing 
b  with  b,  +  Ab. 

Table  16-25 


ROW 

(0) 

(1) 

(2) 

(3) 


r 

BASIS 


Z 

1 

0 

0 

0 


-6 

5 

2 

1 


x2 

-1 

6 

1 

0 


0 

1 

0 

0 


M 

0 

1 

0 


■S3 

0 

0 

0 

-1 


AS 

M 

0 

0 

1 


bi 

0 

30+^b^ 

8 

1 


Clearing  out  the  "M"  values  from  row  (0)  to  create  the  identity  matrix: 


ROW 

(0) 

(1) 

(2) 

(3) 


BASIS 

Z 

S, 


z 

1 

0 

0 

0 


X1 

-6-3M 

5 

2 

1 


Table  16-26 
x2  S1  a2 


-1-M 

6 

1 

0 


0 

1 

0 

0 


0 

0 

1 

0 


E3 

M 

0 

0 

-1 


A3  bi 

°  "9a  Ab, 

0  30+Abi  6+~5i 

0  8  4 

1  1  1 


The  solution  is  not  optimal.  X^  enters,  A ^  leaves.  The  new  solution  is: 


Table  16-27 


ROW  BASIS 

Z 

X 

1  X2 

S1 

A2 

e3 

a3 

bi 

5i 

(0)  Z 

1 

0 

-1-M 

0 

0 

-6-2M 

6+3M 

6-6M 

Ab, 

5+-g- 

(i)  sx 

0 

0 

6 

1 

0 

5 

-5 

25+Abi 

(2)  A2 

0 

0 

1 

0 

1 

2 

-2 

6 

3  * 

(3)  X1 

0 

1 

0 

0 

0 

-1 

1 

1 

- 

1  solution  is  not  optimal.  E§  enters  and  A2 

leaves. 

The  new  solution  is 

Table  16-28 

ROW  BASIS  Z 

*1 

x2 

*2 

=  3 

*3 

bi 

*i 

(0)  Z  1 

0 

2  0 

3+M 

0 

M 

24 

(1)  31  0 

0 

7/2  1 

-5/2 

0 

0 

10+ Ab, 

(2)  E3  0 

0 

1/2  0 

1/2 

1 

-1 

3 

(3)  X1  0 

1 

1/2  0 

1/2 

0 

0 

4 

The  solution  is  optimal.  It  is  not  necessary  to  completely  rework  the 
problem  as  was  just  done  for  purposes  of  illustration.  Since,  in  the  pro¬ 
cess  of  solving  this  problem,  the  same  algebraic  row  operations  were  per¬ 
formed  on  both  the  left-hand  and  right-hand  sides  of  the  equation,  the  Abi 
terms  in  column  bj  will  have  exactly  the  same  coefficients  as  the  vector  S} 
(Sj  being  the  slack  associated  with  constraint  (1)  and,  therefore,  bj): 
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Table 

16-29 

ROW 

BASIS 

Z 

X1 

X, 

S1 

*2 

®3 

A3 

bi 

(0) 

Z 

1 

a 

\ 

0 

3+H 

0 

N 

24+0 A». 

(1) 

S1 

0 

0 

7/2 

1 

-5/2 

0 

0 

10+IAbj 

(2) 

=3 

0 

0 

1/2 

0 

1/2 

1 

-1 

3+0 Ab^ 

(3) 

*1 

0 

1 

1/2 

0 

1/2 

0 

0 

4+0Ab. 

Now,  recalling  that  the  basic  variables  will  remain  in  the  solution 
basis  as  long  as  they  are  >0,  we  can  write:  10  +  Abj  >  0  or  Abj  >  *>10 
since  bj  *  30  in  the  original  problem,  as  long  as  bj  >  (bj  +  Abj )  > 

(30  -  10)  >  20,  the  solution  basis  will  remain  unchanged.  Now,  consider 
the  second  constraint:  2Xj  +  X2  -  a.  In  this  case,  we  introduced  the 
artificial  variable  A2  to  obtain  an  initial  identity  matrix.  Consequently, 
the  coefficients  appearing  under  Aj  in  the  optimal  solution  are  the  ones  to 
use  as  coefficients  for  the  Ab2  term  under  the  b^  column.  Recognizing 
this,  you  can  write: 


(0) 

24  + 

(3  +  M)Abj 

(1) 

10  + 

(-5/2)Ab2  >0 

(2) 

3  + 

1/2  Ab2  >0 

(3) 

4  + 

1/2  Ab2  >0 

From  ( 1 ) :  Ab2  <  4 
From  (2):  Ab2  >-6 
From  (3):  Ab2  >-8 

Combining  these:  -6  <  Ab2  <  4 

Now,  since  bj  «  8:  2  <  b2  <  12 

Note  that  as  long  as  remains  in  the  range  2  <  b^  <  12  (or  alternatively, 
-6  <  Abg  <  4)  the  optimal  solution  basis  remains  unchanged.  However,  the 
value  of  these  basic  variables  can  change  as  b^  varies.  Suppose,  for 
example,  t>2  is  changed  to  10  (Ab^  ■  2).  We  can  assess  the  effect  of  this 
change  without  resolving  the  problem: 


(0) 

Z 

24 

+ 

3(2) 

(1) 

S  « 

10 

- 

5/2(2) 

(2) 

B  - 

3 

+ 

1/2(2) 

(3) 

X  » 

4 

+ 

1/2(2) 

*Note:  the  "M"  term  in  the  coefficient  (3  +  M)  is  ignored. 
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If,  however ,  we  want  to  assesa  the  effect  of  Abj  “  this  exceeds  the 
previously  determined  range  and  the  problem  must  be  resolved.  Finally, 
consider  the  third  constraint:  Xj  1  1.  Here,  too,  an  artificial  variable 
(A3)  was  used  in  the  initial  solution  basis.  The  coefficients  under  A a  in 
the  final  tableau  are  the  coefficients  to  use  for  the  coefficients  of  *3 
in  column  b^t 


(0) 

24  +  MAb3 

(1) 

10  + 

0Ab 

>  0 

(2) 

3  - 

1  Ab 

>  0 

(3) 

4  + 

oo> 

>  0 

From  (2): 

Ab  3  < 

3 

Therefore,  since  b3 

*  1, 

*>3 

The  range  over  which  a  right-hand  constant  (b^)  can  vary  without  changing 
the  optimal  solution  basis  is  called  the  range  of  feasibility. 

Shadow  Prices 

The  concept  of  the  range  of  feasibility  for  b£  suggests  a  very  important 
related  notion  -  tb-  "shadow  price."  The  solution  to  every  LP  model  is 
determined  by  the  intersection  of  two  or  more  limiting  or  binding 
’jQf  constraints,  i.e.,  constraints  for  which  all  of  the  available  resource  (b*) 

is  consumed  in  the  optimal  solution  or  for  which  demand  is  satisfied 
without  surplus.  In  this  case,  the  ability  to  improve  Z  even  further  is 
constrained.  As  a  manager,  it  is  of  interest  to  know  how  much  it  is  worth 
(in  terms  of  Z)  to  have  an  additional  unit  of  the  constraining  resource  or 
requirement.  This  value  is  called  the  shadow  price  for  the  ith  constraint. 
The  shadow  price  can  be  read  directly  from  the  final  (optimal)  solution 
table.  The  shadow  price  associated  with  the  ith  constraint  is  the  coef¬ 
ficient  in  row  (0)  underneath  the  associated  slack  variable  for  5 
constraints  and  the  pure  number  (ignoring  "M"  terms)  for  the  artificial 
variable  associated  with  (■)  or  (£)  constraints.  The  shadow  price  holds 
throughout  the  range  of  feasibility  for  that  constraint.  Consider,  for 
example,  the  final  tableau: 

Table  16-30 


HOW 

BASIS 

Z 

*1 

*2 

Si 

S2 

*3 

*>i 

(0) 

Z 

1 

0 

0 

2 

0 

3+M 

38 

(1) 

X1 

0 

1 

0 

1 

0 

-1 

4 

(2) 

s2 

0 

0 

0 

1 

1 

-3 

2 

(3) 

x2 

0 

0 

1 

-1 

0 

2 

2 

The  shadow  price  for  constraint  (1)  -  2.  The  range  of  feasibility  for  this 
constraint  and  shadow  price  is: 
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Abj  >  -4 
Abi  >  -2 
Ab  j  <  2 


(0) 

38 

+ 

2Ab1 

(1) 

4 

+ 

14b} 

(2) 

2 

+ 

ia>x 

(3) 

2 

+ 

1&>! 

-2  <  Ab  <2 


For  constraint  ( 2) ,  the  shadow  price  is  0  (fs  you  would  expect  because 
Sg  ”  2,  i.e.,  we  currently  have  2  excess  units  of  commodity  (2),  so  why  buy 
any  more?).  The  range  of  feasibility  for  (2)  is: 


(0)  38  +  0Ab2 

(1)  4  +  0Ab2  >  0 

(2)  2  +  1Ab2  >0  Ab2  >  -2 

(3)  2  +  0Ab2  >  0 

For  constraint  ( 3 ) ,  the  shadow  price  is  3 ,  corresponding  to  the  pure 
number  portion  of  the  quantity  (3  +  M).  The  corresponding  range  j>f  feasi¬ 
bility  is: 


(0) 

(1) 

(2) 

(3) 


38  +  3Ab3 
4  -  1 Abg  >  0 
2  -  3Ab3  >  0 
2  +  2Ab3  >  0 


Ab3  <  4 
Ab3<  2/3 
Ab3  >  -1 


1  <  Ab3  <  2/3 


Other  Postoptimalitv  Analyses 


In  addition  to  examining  the  ranges  of  insignificance,  optimality, 
feasibility,  there  are  other  types  of  postoptimality  analysis  that  can 
accomplished.  For  example ,  sensitivity  analysis  can  be  accomplished  on 
constraint  coefficients  (a^j ) .  This  analysis  effectively  seeks  to  deter¬ 
mine  the  sensitivity  of  the J optimal  solution  to  variation  in  the  slope  of 
the  respective  constraints.  Computationally,  this  analysis  is  somewhat 
more  difficult,  the  difficulty  depending  on  whether  basic  or  nonbasic 
variables  are  being  considered.  As  suggested  previously,  it  is  also 
possible  to  analytically  assess  the  sensitivity  of  the  optimal  solution  to 
the  simultaneous  variation  of  two  or  more  parameters,  as  well  as  to  examine 
the  sensitivity  of  the  solution  to  the  addition  or  deletion  of  constraints 
in  the  decision  system  model. 
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■  1i5 


To  summarize  and  reiterate,  mathematical  modeling,  in  general,  and  LP, 
in  particular,  is  a  process  which  includes  a  systematic  analysis  of  the 
decision  situation,  constructing  a  mathematical  model  that  adequately  cap¬ 
tures  the  essence  of  the  situation,  and  solving  the  model.  Equally  impor¬ 
tant,  the  process  also  includes  interpreting  the  model  solution  back  to  its 
parent  reality,  in  part  by  conducting  postoptimality  analysis  to  assess  the 
sensitivity  of  the  model-derived  optimal  policy  to  changes  in  model  parame¬ 
ters.  It  is  important  to  recognize  that  a  manager's  decision-taking  effec¬ 
tiveness  is  affected  to  a  large  extent  by  the  model  used  to  assess  the 
situation  and  to  derive  a  particular  course  of  action,  whether  that  model 
is  an  intuitive  mental  image  or  a  more  systematic  and  rigorous  formulation. 


The  following  ia  provided  to  illustrate  the  total  modeling  process,  i.e., 
system  analysis,  model  formulation,  model  solution,  and  postoptimality  sen¬ 
sitivity  analysis. 

Problem  Statement 


A  particular  aircraft  system  corrosion  control  solution  is  made  from  three 
basic  concentrated  ingredients:  X^,  X2,  and  X3.  A  batch  of  the  eolution  must 
include  at  least  20  ounces  of  the  concentrates,  but  not  more  than  40  ounces. 
Further,  for  each  ounce  of  Xj,  at  least  1/4  ounce  of  X3  must  be  used.  Similarly, 
at  least  1/2  ounce  of  X2  should  be  used  for  each  ounce  of  Xj.  The  costs  of  the 
concentrates  X3,  X2,  and  X3  are  $5,  $15,  and  $20  per  ounce,  respectively. 

System  Analysis 

System  Components: 

1,  Criterion  Variable  (Z ) ;  Total  cost  for  a  batch  of  solution  in 
dollars;  the  objective  is  to  minimize  Z, 

2,  Decision  Variables  (Xj):  Xj  represents  the  ounces  of  ingredient  j 
used  in  solution  (j  *  1,2,3) 

3,  Environmental  Variables  (Bl) 

B^:  total  ounces  of  concentrates  (20  £  B^  5  40) 

B2:  ratio  of  X^  to  X3  (B2  -  4) 

B3:  ratio  of  Xx  to  Xj  (Bj  <  2) 

System  Relationships 


Figure  16-16 


UP  Model 


minimize : 

Z  -  5XX  +  15X 

2  +  20x3 

s.t. 

xx  +  x2  + 

x3 

> 

20 

XX  +  X2  + 

x3 

< 

40 

X1 

4X3 

< 

0 

Xi  -  2X2 

< 

0 

Xx,  X 2»  X3  >  0 
IiP  Modal  Solution 
Standard  Form: 


maximize  (-Z)  -  -5X1  -  15  X2  -  20X3  -  0EL  -  MAj. 

-  0S2  “  OS3  -  OS4 

S.t.  X^  +  X2  +  X3  -  +  Ai  - 

X1  +  *2  +  X3  +  S2  - 

X£  -  4X3  +  S3 

*1  “  2X2  +  S4 

Xj_  1  x2 »  X3 ,  Ej_  1  Aj_  1  S^ ,  S2  /  S3  ,  S4  2 

Convert  to  tableau  form: 


Table  16-31 


ROW  BASIS 

Z 

X1 

x2 

x3 

*1 

M 

S2 

S3 

S4 

bi 

(0)  Z 

-1 

+5 

+15 

+20 

0 

M 

0 

0 

0 

0 

(1) 

0 

1 

1 

1 

-1 

1 

0 

0 

0 

20 

(2) 

0 

1 

1 

1 

0 

0 

1 

0 

0 

40 

(3) 

0 

1 

0 

-4 

0 

0 

0 

1 

0 

0 

(4) 

0 

1 

-2 

0 

0 

0 

0 

0 

1 

0 

To  create  an 
column  A],: 

initial 

basic  feasible  solution. 

Table  16-32 

eliminate 

M  from 

row 

(0) 

ROW 

BASIS 

Z 

X1 

x2 

x3 

Ei 

A1 

s2 

S3 

S4 

b 

(0) 

Z  -1 

5-M 

15-M 

20 -M 

M 

0 

0 

0 

0 

-2 

(1) 

A1 

0 

1 

1 

1 

-1 

1 

0 

0 

0 

2 

(2) 

S2 

0 

1 

1 

1 

0 

0 

1 

0 

0 

4 

(3) 

S3 

0 

© 

0 

-4 

0 

0 

0 

1 

0 

(4) 

S4 

0 

1 

-2 

0 

0 

0 

0 

0 

1 

20  (1) 

40  (2) 

0  (3) 

0  (4) 

0 


Solution  is  not  optimal 
X^  enters 

Tie  for  the  leaving  variable  between  S3  and  S4;  arbitrarily  select  S3 


Table  16-33 


ROW 

BASIS 

z 

X1 

X2 

x3 

X1 

Ai 

s2 

S3 

s4 

bi 

*i 

(0) 

Z 

-1 

0 

15-M 

40-5M 

M 

0 

0 

-5+M 

0 

-20M 

(1) 

A1 

0 

0 

1 

5 

-1 

1 

0 

-1 

0 

20 

4 

(2) 

s2 

0 

0 

1 

5 

0 

0 

1 

0 

0 

40 

8 

(3) 

X1 

0 

1 

0 

-4 

0 

0 

0 

1 

0 

0 

- 

(4) 

S4 

0 

0 

-2 

4 

0 

0 

0 

1 

1 

0 

0 

Solution  is  not  optimal 
X3  enters 
S4  leaves 


Table  16-34 


ROW 

BASIS 

z 

X1 

x2 

35 

*3 

Ei 

A1 

s2  S3 

-15 

S  4 
-10 

bi 

*i 

(0) 

Z 

-1 

0 

-7/2M 

0 

M 

0 

0  +9/4M 

+5/4M 

-20M 

(1) 

A1 

0 

0 

7/2 

0 

-1 

1 

0  -9/4 

-5/4 

20 

40/7 

(2) 

s2 

0 

0 

7/2 

0 

0 

0 

1  -5/4 

-5/4 

40 

80/7 

(3) 

X1 

0 

1 

-2 

0 

0 

0 

0  2 

1 

0 

(4) 

*3 

0 

0 

“1/2 

1 

0 

0 

0  1/4 

1/4 

0 

Solution  is  not  optimal 
X2  enters 
A 1  leaves 

wr 

Table  16-35 


ROW 

BASIS 

z 

X1 

x2 

X3 

Ei 

A1 

s2 

S3 

S4 

bi 

(0) 

Z 

-1 

0 

0 

0 

10 

-10+M 

0 

15/2 

5/2 

-200 

(1) 

x2 

0 

0 

1 

0 

-2/7 

2/7 

0 

-9/14 

-5/14 

40/7 

(2) 

s2 

0 

0 

0 

0 

1 

-1 

1 

1 

0 

20 

(3) 

X1 

0 

1 

0 

0 

-4/7 

4/7 

0 

5/7 

2/7 

80/7 

(4) 

X3 

0 

0 

0 

1 

“1/7 

1/7 

0 

-1/14 

1/14 

20/7 

Solution  is  optimal 
Xi  -  80/7  Z  *  200 
X2  -  40/7 
X3  -  20/7 


Nonbasic  Variables!  (none) 
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Basic  Variables  (X^  X2,  ) : 

=  5  10  -  4/7ACi 

-10  +  M  +  4/7AC! 
15/2  +  5/7  A Ci 
5/2  +  2/7  A Ci 

-  35/4 

-  15/4 


C2  -  15  10  -  2/7Ac2 

-10  +  M  +  2/7Ac2 
15/2  -  9/14  Ac2 
5/2  -  5/14  Ac2 
Ac2 
Cl 


C3  =20  10  -  1/7AC3 

-10  +  M  +  1/7AC3 
15/2  -  1/14  Ac3 
5/2  +  1/14  Ac3 

-  35  <  Ac, 

-  15  <  ^ 


bi  =  20  40/7  +  2/7 Abi 

20  -  Abi 
80/7  +  4/7 Abi 
20/7  +  1/7 Abi 
-20  <  Abx 

0  <  bi 


bz  =  40  40/7  +  0  Ab2 

20+1  Ab2 
80/7  +  0  Ab2 
20/7  +  0  Ab2 
Ab2 
>2 


b3  ■  0  40/7  -  9/14Ab3 

20  +  lAb3 

80/7  +  5/7  Ab3 
20/7  -  1/14Ab3 
-16  <  Ab3 
-16  <  b. 


>  0 

A  Ci 

< 

35/2 

>  0 

A  Ci 

> 

7/4 { -M+ 10) 

>  0 

A  Ci 

>_ 

21/2 

>  0 

A  Ci 

>  _ 

35/4 

A Ci  <  35/2 

Ci  <  90/4 

>  0 

ac2 

< 

35 

>  0 

ac2 

> 

7/2  ( 10— M) 

>  0 

ac2 

< 

35/3 

>  0 
<  7/3 

ac2 

< 

7/3 

<  17  1/3 

>  0 

AC3 

< 

70 

>  0 

ac3 

> 

7( 10-M) 

>  0 

AC3 

< 

105 

>  0 

ac3 

>. 

-35 

<  70 

<  90 

> 

0 

Abi  >  “20 

> 

0 

Abi  <  20 

> 

0 

Abi  >  ~20 

> 

0 

Abi  >  ~20 

<  20 
<  40 


>  0 

>  0  Ab2  >  -20 

>  0 
>  0 
>-20 
>  20 


>  0  Ab3  K  80/9 

>  0  Ab3  >  -20 

>  0  Ab3  >  -16 

>  0  Ab3  <  40 

<  80/9 

<  80/9 


b4  -  0 


40/7  -  5/14Ab4  >  0 
20  +  0&b4  >  0 

80/7  +  2/7Ab4  >  0 

20/7  -*■  1/14Ab4  >  0 
-40  <  Ab4  <  16 
-40  <  b  <  16 


Ab4  <  16 

Ab4  >  -40 
Ab4  >  -40 


SPECIALIZED  LP  ALGORITHMS 


Linear  programing  has  achieved  its  popularity  in  part  because  of  its 
wide  applicability*  Certain  classes  of  LP  problems  have  specialized  structures 
which  permit  solution  by  algorithms  which  are  computationally  more  efficient 
than  the  more  generally  applicable  simplex  method.  The  transportation  problem 
and  the  assignment  problem  are  included  in  this  section  because  of  their 
utility  and  applicability  in  commonly-encountered  managerial  decision¬ 
taking  situations* 

The  Transportation  Model 

The  transportation  model  refers  to  that  class  of  decision  situations 
in  which  some  commodity  is  available  in  specified  limited  quantities  at  a 
number  of  supply  points  (origins)  and  is  required  in  other  specified 
amounts  at  a  number  of  demand  points  (destinations).  The  model  assumes 
that  there  is  no  difference  in  the  quality  of  the  commodity  available  at 
each  supply  point*  The  objective  is  to  allocate  the  available  supply  to 
meet  all  demand  requirements  in  such  a  manner  as  to*  typically,  minimize 
the  total  transportation  cost  or  to  maximize  total  profit  associated  with 
the  allocation. 

As  with  many  other  management  science  models,  the  transportation  model 
takes  its  name  from  the  prototype  problem  for  which  the  model  was  deve¬ 
loped.  However,  the  structure  of  the  classical  transportation  problem 
applies  to  many  decision  situations  having  nothing  to  do  with  the  transpor¬ 
tation  or  shipment  of  commodities.  Because  of  this  flexibility,  the 
transportation  model  and,  relatedly,  the  assignment  model  are  often 
described  by  the  more  generic  label:  distribution  models  or  systems. 

General  System  Analysis 

System  Components: 

1.  Criterion  Variable  (Z):  Total  cost  to  distribute  a  particular 
commodity  from  supply  points  i(  i*  1,2,...,m)  to  demand  points  j  (j*1,2, . . .  ,n) . 

2.  Decision  Variable  (X<j):  *  ij  represents  the  units  of  commodity 

distributed  from  source  i  to  demand  point  j,  where  i  •  1,2,...,m  and 

j  *  1,2, . • .  ,n 

3.  Environmental  Variables: 

S j.  -  amount  of  commodity  allocated  from  origin  point  (Si  -  si) 

Dj  •  amount  of  commodity  allocated  to  destination  point  (Dj  “  dj) 

S  ”  total  supply  available  at  all  sources 
D  ”  total  demand  required  at  all  destinations 
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Mathematical  Formulation 


n  n 

minimize:  Z  -  £  £  c,jX 


s  •  t  • 


i-1  j-1  « U 

n 

l  X,  «  S.  (i  -  1,2, ...,m) 
j-1  1 

m 

I  Xi;j  -  Dj  ( j  -  1,2,...  n) 

m  n 

I  Si-  I  Dj 
i-1  j-1 


^j>  0 

Note:  Refer  to  page  16-58  for  procedures  to  be  followed  in  the  more 

general  case  when  total  supply  does  not  equal  total  demand,  i.e.,  when 


l  st/ I,Dj 

i-1  j-1 


Formulation  Examples 


Example  16-4,  is  an  example  of  a  "classical"  transportation  problem 
involving  the  physical  distribution  of  a  oonsaodity  from  supply  points  to 
demand  points* 


EXAMPLE  16-5 


The  Base  Civil  Engineer  at  Grand  Forks  A7B  is  planning  the  exterior 
painting  work  that  must  be  done  during  the  short  susner  period  when  this 
type  of  work  can  be  satisfactorily  accomplished .  The  planners,  together 
with  the  foreman  of  the  painting  shop,  have  estimated  the  following 
requirements : 


Base  Facilities  7,800  manhours 

Missile  Sites  3,200  " 

Family  Housing  9,600  " 

Labor  is  available  from  three  sources: 

In-House:  13,000  manhours  $8 /manhour 

Overhires:  4,160  "  $6/manhour 

Contract:  3,440  "  $ 10/manhour 

Because  of  economic  and  security  reasons,  it  is  not  practical  to  use 
contract  labor  on  the  missile  site  work.  What  manpower  policy  would  you 
recommend? 


System  Analysis 


System  Components; 


1.  Criterion  Variable  ( 2) :  Total  labor  cost  in  dollars  to 
accomplish  exterior  painting  requirements;  the  objective  is  to  minimize 
total  cost. 


2*  Decision  Variable  .) :  Manhours  from  source  i  used  on  task 
j,  at  a  unit  cost  of  C^j,  where:  J 


i« 

Cij 

j 

In-house 

8 

1: 

Base 

Overhire 

6 

2: 

Missile 

Contract 

10 

3: 

Housing 

3.  Environmental  Variables: 

S,L  -  manhours  of  in-house  labor  used  (S^ 

3 2  *  manhours  of  overhire  labor  used  (s2 

S,  -  manhours  of  contracted  labor  used  (S„ 

J  3 

S  “  total  manhours  used  from  all  sources  (S 

D^  *  total  manhours  used  for  base  facilities  (D^  -  7,800) 

D2  -  total  manhours  used  for  missile  sites  (D2  -  3,200) 

D3  -  total  manhours  used  for  family  housing  (D3  »  9,600) 

D  -  total  manhours  used  for  all  requirements  (D  -  20,600) 


-  13,000) 

-  4,160) 

-  3,440) 
-  20,600) 
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Mathematical  (LP)  Modal 


minimize :  Z 


EXAMPLE  16-6 


axii  ♦ 

®Cl2+8Xl3  + 

6*21  +  6X22  +  6X23  +  10X31  +  10X33 

*11  + 

*12  +  *13 

•  13,000 

*21  * 

*22  +  *23 

-  4,160 

*31 

*  *33 

-  9,600 

xn  ♦ 

*21  *  *31 

-  7,800 

*12  + 

*22 

-  3,200 

*13  + 

*23  +  *33 

-  9,600 

Xtj  >0  (all 

i  and  j) 

You  are  the  chief  of  a  small  engineering  section  primarily  responsible 
for  research  and  development  on  avionics  systems.  Currently,  you  have  a 
staff  of  five  design  engineers  whose  experience  and  technical  expertise  vary. 
During  the  next  three  months,  your  office  must  complete  five  independent 
projects,  each  of  which  will  consume  the  efforts  of  one  of  your  staff  engi¬ 
neers.  In  attempting  to  determine  how  the  projects  should  be  assigned,  you 
have  estimated,  on  a  scale  from  one  to  ten,  the  efficiency  (utility)  of 
assigning  a  particular  engineer  to  a  specific  project.  These  estimates  are 
based  on  your  assessment  of  the  nature  of  each  project  (i.e.,  expertise 
required)  and  the  skills  and  experience  of  your  staff.  The  following  table 
summarizes  these  estimates: 


Table  16-36 
PROJECT 


1 

2 

3 

4 

5 

1 

6 

8 

9 

6 

2 

2 

8 

6 

2 

3 

3 

3 

6 

5 

8 

7 

6 

4 

8 

5 

6 

3 

1 

5 

8 

6 

5 

6 

0 

How  should  the  engineer-project  assignments  be  made  to  maximize  total  utility? 
System  Analysis 

System  Components: 

1.  Criterion  Variable  (Z):  Total  utility,  maximize  Z. 

2.  Decision  Variable  (X^j):  Xy  *  1  if  engineer  i  is  assigned  to 
project  j  (i,  j  -  1,2,.. .,5);  Xjj  *  0  ir  engineer  i  is  not  assigned  to  pro¬ 
ject  j. 

3.  Environmental  Variables: 

Sj_  ■  no.  of  projects  assigned  to  engineer  i  (S^  -  1),  i  -  1,2,..., 5 
Dj  “  no.  of  engineers  assigned  to  project  j  (Dj  •  1),  j  ■  1,2,..., 5 
SJ  “  Total  no.  of  engineers 
D  -  Total  no.  of  projects 
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Mathematical  (LF)  Modal 


maximize:  Z  ■  6X33  +  8X32  +  8X33  +  6*34  +  2X35 

+  8X21  +  6X22  +  2X23  +  3*24  +  3X25 

+  6X31  +  5*32  +  8X33  +  7*34  +  ®*3S 

+  8X43  +  3X42  +  8X43  +  3X44  +  IX45 

+  8X53  +  6X52  +  5X53  +  6X54  +  OX55 


8  e  t  e 


*11  +  * 
*21 


12 

*22 


+  x13 

+  *23 


*14 

*24 


+  X 


15 

*25 


X51  +  X52 


+  X 


53 


+  x54  +  X 


55 


*11  +  *21  +  *31  +  *41  +  *51  " 
*12  +  *22  +  *32  +  *42  +  *52  “ 

e 

*15  +  *25  +  *35  +  *45  +  *55  * 


*ij  -  0,1  (i,  j  -  1,2 . 5 


Solution  Techniques 


Because  the  transportation  model  is  a  special  case  of  the  more  general 
If  model,  the  simplex  algorithm  can  be  used  to  solve  the  transportation 
problem.  In  fact,  conceptually,  the  steps  in  the  transportation  method 
(alternatively  called  the  transportation  simplex)  are  essentially  the  same 
as  those  in  the  basic  simplex  algorithm*  Only  the  mechanics  by  which  solu¬ 
tion  bases  are  created,  examined  for  optimality,  and  revised  are  different. 

The  transportation  method  has  two  structural  requirements  which  must  be 
observed.  First,  total  supply  must  equal  total  demand  (effectively,  this 
is  the  same  as  converting  inequalities  to  equalities  by  adding  slack  or 
surplus  variables) .  Secondly,  the  solution  basis  will  never  contain  more 
than  (m  +  n  -  1)  basic  variables,  where  m  equals  the  number  of  supply 
points  or  origins  and  n  equals  the  number  of  demand  points  or  destinations. 
This  is  due  to  the  inherent  redundancy  in  constraints  created  by  the 
requirement  that  total  supply  equal  total  demand.  When  there  are  fewer 
than  (m  +  n  -  1)  basic  variables  in  solution  (not  equal  to  zero),  the  solu¬ 
tion  is  degenerate. 

The  first  step  in  the  solution  procedure  is  to  convert  the  problem  to 
a  standard  tabular  form  in  which  supply  equals  demand  and  the  objective  is  — 
one  of  minimization.  The  generalized  tabular  form  for  the  transportation 
model  is  illustrated  in  Table  16-37. 


1 


To  illustrate  the  procedure/  consider  the  following  mathematical  (LP)  model: 
minimize:  Z  -  4Xn  +  8X12  +  8X13  +  16*21  +  24X22  +  16X23  +  8X31  +  16X32+  24X33 


3»t* 

X11  +  X12 

< 

76 

supply  constraints: 

X21  +  X22  +  X23 

< 

82 

X31  +  x32  +  x33 

< 

77 

S  -  235 

X11  +  X21  +  X31 

* 

72 

x12  +  x22  +  x32 

3 

102 

x23  +  x33 

s 

41 

D  *  215 

Xij  I  0  (i,j  »  1,2,3) 

Notice  that  total  supply  (235)  exceeds  total  demand  (215'.  To  solve  this 
problem  by  the  basic  simplex  algorithm,  recall  that  it  would  be  necessary 
to  convert  the  supply  inequalities  to  equalities  by  introducing  a  slack 
variable  in  each  of  the  (<)  constraints.  It  is  necessary  to  take  an 
equivalent  step  in  the  transportation  (simplex)  algorithm.  Since,  in  this 
example,  supply  exceeds  demand  by  20  units,  we  need  to  create  a  "dummy" 
demand  point  or  destination  having  a  demand  for  20  units.  Had  demand 
exceeded  the  supply,  it  would  have  been  necessary  to  introduce  a  dummy 
supply  point.  The  initial  transportation  table  is  illustrated  in  Table  16-38. 
Note  that  C^j  *  0  for  all  dummy  variables  introduced  to  "balance"  the  table. 

It  should  also  be  noted  that  if  a  particular  link  (X^j)  is  not  feasible  (in 
this  example,  *^3),  *  +M.  This  will  insure  X^j  is  driven  out  of  the 

solution  basis. 


Table  16-38 


i 

1 


1 
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c 


Once  the  problem  is  converted  to  standard  tabular  form,  the  next  step 
is  to  identify  an  initial  basic  feasible  solution.  There  are  a  number  of 
procedures  for  obtaining  an  initial  solution.  The  Northwest  Corner  Rule 
is  a  very  simple,  though  not  necessarily  efficient,  method.  You  begin  in 
the  upper-left  ("northwest")  corner  and  allocate  as  much  as  possible.  In  this 
case,  72  units  can  be  placed  in  cell  (1,1).  This  allocation  uses  up  all  of 
the  demand  in  column  (1),  but  leaves  76  -  72  ■  4  units  in  row  (1).  These 
remaining  units  are  allocated  to  (1,2).  Allocating  these  4  units  to  (1,2) 
leaves  98  units  in  column  (2).  However,  of  these,  only  82  can  be  committed 
to  (2,2)  because  the  supply  for  the  second  row  is  limited  to  that  amount. 
Allocating  82  units  to  (2,2)  leaves  16  units  in  column  (2).  These  can  be 
assigned  to  (3,2)  satisfying  the  demand  of  column  (2)  and  reducing  the 
supply  in  row  (3)  from  77  to  61.  Because  of  the  demand  in  column  (3),  only 
41  units  can  be  allocated  to  (3,3),  satisfying  the  demand  in  that  column 
and  leaving  20  units  for  assignment  to  cell  (3,4).  Because  the  table  was 
constructed  to  insure  that  total  supply  (S)  and  total  demand  (D)  are  equal, 
allocating  the  remaining  20  units  to  (3,4)  closes  out  both  row  ( 3 )  and 
column  ( 4 ) . 


The  initial  basic  feasible  solution  then  is: 


*11 

m 

72 

Z  -  4(72) 

a 

288 

*12 

a 

4 

+  8(4) 

+ 

32 

x22 

m 

82 

+  24(82) 

+ 

1968 

x32 

m 

16 

+  16(16) 

+ 

256 

x33 

a 

41 

+  24(41) 

+ 

984 

x34 

a 

20 

+  0(20) 

+ 

0 

$  3528 


16-59 


A  second  procedure  for  determining  an  initial  basic  feasible  solution 
is  called  the  Least  Cost  Rule.  In  this  method,  we  try  to  be  "greedy"  and 
sequentially  allocate  as  much  as  possible  to  cells  with  the  smallest  Cj_  j 
value,  thereby  hoping  to  reduce  the  cost.  Table  16-40  illustrates  the  initial 
basic  feasible  solution  using  the  Least  Cost  method.  Begin  by  finding  the 
cell  (i,  j)  with  the  smallest  j .  In  this  example,  cells  (1,  4),  (2,  4), 
and  (3,  4)  each  have  a  C^j  =  0.  JYou  can  begin  by  selecting  one  of  these 
cells  arbitrarily,  or  you  can  perhaps  do  a  little  better  by  looking  at  each 
option  to  determine  which  leads  to  the  next  smallest  C^j  value.  In  this 
case,  allocating  20  units  to  (1,  4)  permits  you  to  assign  the  remaining  56 
units  to  cell  (1,  1),  which  has  a  unit  cost  of  =  4.  Since  the  next 
"cheapest"  cell  is  (3,  1)  with  C^j  =  8,  the  remaining  16  units  in  column 
(1)  are  assigned,  leaving  a  total  of  61  in  row  (3).  There  is  now  a  tie 
between  (2,  3)  and  (3,  2)  for  the  cell  in  which  an  allocation  can  be  made 
that  has  the  next  smallest  C^j  (=16).  Since  an  allocation  in  either  of 
these  positions  forces  an  allocation  to  (2,  2)  where  C22  =  24 ,  you  can  pro¬ 
ceed  more  or  less  arbitrarily.  Assigning  61  to  (3,  2)  uses  up  the  remaining 
supply  in  row  ( 3 )  and  reduces  the  demand  remaining  in  column  ( 2 )  to  41.  This 
residual  must  now  be  assigned  to  (2,  2),  completing  the  solution  basis. 


Table  16-40 


1 

2 

3 

4(D) 

SUPPLY 

4 

8 

H 

0 

-T6- 

1 

6 

) 

16 

24 

16 

0 

-»2- 

2 

© 

© 

-4-1- 

8 

16 

24 

0 

3 

© 

© 

) 

-n- 

~t0  2 

-#t- 

-26- 

DEMAND 

235 

-+6— 

-*4- 

The  initial  basic  feasible  solution  by  the  Least  Cost  method  is: 


X11 

= 

56 

=  4(56) 

224 

x14 

= 

20 

+  0(20) 

+ 

0 

x22 

a 

41 

24(41 ) 

+ 

984 

x23 

= 

41 

16(41  ) 

+ 

656 

X31 

a 

16 

8(16) 

+ 

128 

x32 

=* 

61 

16(61) 

+ 

976 

$  2,968 


While  the  Least  Cost  Rule  is  still  rather  conceptually  unsophisticated, 
it  does  -  in  this  case  -  yield  an  initial  basic  feasible  solution  that  is 
$3,528  -  $2,968  *  $560  better  than  that  obtained  using  the  "unthinking" 
Northwest  Corner  Rule. 

The  difficulty  with  the  Least  Cost  decision  rule  is  that  the  sequential 
selection  of  the  cell  with  the  smallest  Cij  value  may  prevent,  in  a  sub¬ 
sequent  assignment,  allocation  to  a  cell  with  a  lower  C^j  value  than  the 
one  selected.  A  procedure  referred  to  as  Vogel's  Approximation  Method  (VAM) 
attempts  to  mitigate  this  problem  by  first  computing  the  opportunity  cost 
or  penalty  for  a  particular  allocation.  The  procedure  is  as  follows: 


1.  For  each  row  and  column,  find  the  difference  between  the  two 
lowest  CiJ  values.  This  is  the  "Vogel  number." 

2.  Select  the  largest  Vogel  number  and  make  the  largest  allocation 
possible  to  the  cell  (i,  j)  corresponding  to  the  smaller  of  the  two  C^j 
values.  Ties  can  be  broken  arbitrarily • 

3.  After  each  assignment  the  procedure  is  repeated,  using  only  the 
remaining  rows  and  columns^, 

For  the  example,  the  Vogel  computations  are  as  follows: 


ITERATION 

1 


ROW 

1  4<CU)  -  0(^4)  *  4 

2  16(C2i/23>  -  °<c24>  “ 

3  16(C32)  -  8(C31)  =  8 
COLUMN 

1  8(C31)  -4<CU)  -  4 

2  16(C32)-  8<C12)  -  8 

3  24(C33)  -  16(C23)  -  8 

4  <=14/24/34 }  ~  0(C14/24/34)  "  0 


Assign  20  to  (2,  4) 

ITERATION  ROW 

2  1 

2 

3 

COLUMN 

1 

2 

3 


8(C12)  -  4(CU)  -  4 
16(C2i/23>  -  16(C2^/23)  ■  0 
16(C32)  -  8(C31)  -  8  W'(TIE) 

8<C31)  -  4(Cn)  -  4 

16(C32)  -  8(C12)  -  8 

24<C33)  -  16(C23)  -  8 
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Arbitrarily  select  (3,  1)  and  assign  72 


ITERATION 

3 


ROW 

1  M(C13)  -  8(C12)  -  Mn/ 

2  24(C22)  “  16(C23)  -  8 

3  24(033)  "  16(C32)  -  8 
COLUMN 

2  16(C32)  -  8(C12)  -  8 

3  24(C33)  -  16(C23)  -  8 


Assign  76  to  (1,  2) 


ITERATION 

4 


ROW 

2 

24(C22)  -  16 (C23 ) 

=  8  V 

3 

24(C33)  -  16(C32 ) 

*  8 

COLUMN 

2 

24(C22)  -  16(C32) 

-  8 

3 

24(C33)  -  16(C23) 

=  8 

Arbitrarily  select  (2,  3)  and  assign  41 


ITERATION  COLUMN 

5  2  24(C22)  -  16(C32)  -  8 

Assign  5  to  (3,  2),  forcing  assignment  of  21  to  (22) 


Table  16-41 


1 

2 

3 

4(D) 

SUPPLY 

4 

8 

m 

0 

1 

m 

24 

■ 

0 

2 

m 

-62— 

-24- 

1 

8 

16 

■ 

m 

a 

-42- 

3 

© 

... 

1 02 

DEMAND 

-26- 

-26- 

235 

_ 

• 

_ 
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The  initial  basic  feasible  solution  using  the  VAM  is: 


8(76)  -  608 

24(21)  -  504 

16(41)  -  656 

0(20)  -  0 

8(72)  -  576 

16(5)  -  80 

$  2,424 


Notice  that  this  solution  is  $2,968  -  2,424  ”  $544  better  than  that 
obtained  using  the  Least  Cost  Rule  and  $3,528  -  2,424  m  $1,104  better  than 
the  one  using  the  Northwest  Corner  Rule.  When  solving  transportation 
problems  manually,  VAM  gives  a  good  (often  optimal)  initial  solution  at 
some  computational  expense.  By  contrast,  the  Northwest  Corner  and  Least 
Cost  decision  rules  offer  a  quicker  initial  solution,  but  one  which  is  not 
necessarily  close  to  the  optimal  solution  and  one  which  may  require  (many) 
more  iterations  of  the  transportation  simplex  method  to  reach  optimality. 


Xi2  “ 

76 

Z  - 

x22  “ 

21 

+ 

X23  " 

41 

+ 

X24  * 

20 

+ 

X31  ” 

72 

X32  “ 

5 

Once  an  initial  basic  feasible  solution  by  either  the  Northwest  Corner, 
Least  Cost,  or  VAM  approach  is  determined,  the  next  step  is  to  test  that 
solution  for  optimality.  Two  alternative  procedures  are  commonly  used: 
the  stepping-stone  and  Modified  Distribution  (MODI)  methods. 


The  "steppincr-atona"  procedure  is  very  similar  to  the  basic  simplex 
algorithm.  In  the  simplex,  the  signs  of  the  values  in  row  (0),  which 
represented  marginal  rates  of  contribution  to  the  criterion  (Z)  for  the 
nonbasic  variables,  were  checked  to  determine  whether  or  not  the  solution 
was  optimal.  In  the  stepping-stone  procedure  we  do  the  same  thing.  An 
index  representing  the  marginal  contribution  to  the  criterion  variable 
value  is  computed  for  each  nonbasic  variable  (i.e.,  for  each  empty  cell). 

If  all  of  these  indicators  are  positive,  no  further  improvement  (reduction) 
in  Z  is  possible.  On  the  other  hand,  if  one  or  more  cell  has  a  negative 
indicator,  the  current  solution  basis  is  not  optimal.  Consider  again  the 
initial  basic  feasible  solution  found  by  the  Northwest  Corner  Rule  and 
reproduced  in  Table  16-42.  To  determine  if  this  solution  is  optimal,  we  need 
to  examine  each  nonbasic  variable  (empty  cell)  and  ask  what  the  effect  on  Z 
is  if  one  unit  of  this  (nonbasic)  variable  enters  the  solution  basis. 

Look,  for  example,  at  variable  JL.  .  If  one  unit  enters  (2,  1),  then  cell 
(2,  2)  must  be  reduced  by  one  unit  to  keep  the  total  at  82.  However,  if 
( 2 ,  2 )  is  reduced  by  one ,  then  ( 1 ,  2 )  must  be  increased  by  one  to  keep  the 

column  in  balance  at  102.  Similarly,  increasing  (1,  2)  by  one  means  (1,  1) 

must  be  reduced  by  one  to  keep  row  (1)  equal  to  76.  Now,  for  this  closed 
loop,  i.e.,  for  +(2,  1),  -  (2,  2),  +  (1,  2),  -  (1,  1),  look  at  the 
corresponding  C^j  values:  +16  -  24  +  8  -  4  -  -4.  The  cell  indicator  for 

nonbasic  variable  (2,  1)  is  -4,  meaning  that  for  every  unit  of  X^  brought 

into  the  solution  basis,  Z  changes  by  -4. 


Table  16-42 


DEMAND 


72 


102 


20 


235 


Increasing  ( 2 ,  4)  requires  a  corresponding  decrease  in  (3,  4).  Con¬ 
tinuing,  the  decrease  in  (3,  4)  requires  an  increase  in  (3,  2).  Finally,  the 
increase  in  (3,  2)  forces  a  similar  decrease  in  (2,  2).  Looking  at  the  Cj.j 
values  for  this  closed  loop:  +  0  -  0  +  16  -  24  ■  -8.  The  computations  for 
each  of  the  cell  indicators  are  shown  below: 


HONBASIC  VARIABLE 

(1,  3)  +  K-24+16-8-+M-16  (+) 

(1,  4)  +0-0+16  -8*  +8 

(2,  1)  +  16  -  24  +  8  -  4  -  -  4 

(2,  3)  -+  16  -  24  +  16  -  24  -  -16 

(2,  4)  +  0  -  0  +  16  -  24  -  -  8 

(3,  1)  +  8  -  16  +  8  -  4  -  -4 

Since  there  are  four  negative  indicators,  it  is  apparent  that  this  initial 
solution  is  not  optimal  and  can  be  improved* 


In  the  simplex  procedure,  we  saw  that  a  nonoptimal  solution  was 
improved  by  bringing  the  nonbasic  variable  into  the  basis,  thereby  driving 
one  of  the  basic  variables  out.  The  incoming  variable  was  that  making  the 
largest  contribution  to  the  objective*  The  same  is  true  in  the  stepping- 
stone  algorithm.  The  nonbasic  variable  having  the  most  negative  cell  indi¬ 
cator  is  selected  as  the  entering  variable.  In  this  example,  X23  ®*ke*  the 
greatest  marginal  reduction  in  Z(-16).  In  determining  this  cell  indicator, 
we  found  that  as  X23  was  increased,  X33  decreased,  X32  increased,  and  X22 
decreased.  The  question  of  importance  is  how  much  X23  can  be  increased 
before  one  of  the  basic  variables  is  driven  out  of  the  basis.  In  the 


general  simplex  algorithm,  this  was  determined  by  calculating  the  ratio 


in  the  stepping-stone  algorithm,  we  focus  on  the  basic  variables  in 
the  closed  loop  used  to  determine  the  cell  indicator  for  the  incoming 


variable : 


As  (2,  3)  increases,  both  (3,  3)  and  (2,  2)  decrease.  When  has 
increased  to  41,  X33  will  have  decreased  to  0,  X32  will  have  increased  to 
57,  and  X22  will  have  decreased  to  41.  The  leaving  variable  then  is  the 


basic  variable  in  the  closed  loop  which  has  the  smallest  value  and  which 
decreases  as  the  incoming  variable  increases  in  value.  This  new  solution 
is  illustrated  in  Table  16-44. 


DEMAND 


2 

1 

0 

8 

© 

24 

© 

16 

102 

Checking  this  for  optimality,  we  get: 

NONBASIC  VARIABLE 

(1.3)  +  M  -  16  +  24  -  8  -  N 

(1.4)  +0-0+16-8  -  +8 

(2,  1)  +16-24+8-4  -  -4 

(2,  4)  +0-0+16-24  -  -8 

(2,  1)  +8-16+8-4  -  -4 

(3,  3)  +24-16+24-16  -  +16 

(2,  4)  enters;  (3,  4)  leaves;  the  new  basis  is: 

X,,  -  72 


*23  “ 


Table  16-45  reflects  this  new  solution.  Check  for  optimality 


NONBASIC  VARIABLE 


(1- 

,3) 

+M  - 

16  +  24-8  - 

M 

(1. 

?  4 ) 

+0  - 

0  + 

24  - 

8 

16 

(2, 

,1) 

+  16 

-  24 

+  8-4  - 

-4 

(3, 

-1) 

+8  - 

16  +  8  - 

4 

-4 

(3, 

,3) 

+24 

-  16 

+  24 

-  16  - 

16 

(3, 

p4) 

+0  - 

0  + 

24  - 

16  - 

+8 

Tie  between  (2,1)  and  (3,1)  for  the  entering  variable;  select  (3,1)  because 
it  permits  Z  to  be  reduced  by  -4(72)  vs.  -4(21)  if  (2,1)  is  selected.  As 
(3,1)  enters ,  (1,1)  leaves  and  the  new  solution  is : 


Table  16-46 


Table  16-46  reflects  this  new  solution* 
Testing  for  optimality: 

NONBASIC  VARIABLE 


(1, 

,1) 

4  - 

8  + 

16  -  i 

8 

m 

+4 

(1. 

,3) 

M  - 

16  + 

24  - 

8 

m 

M 

<1, 

.4) 

+0  -  0  + 

24  - 

8 

- 

+  16 

(2, 

,1) 

+16 

-  24 

+  16 

- 

8-0 

(3, 

,3) 

+24 

-  16 

+  24 

- 

16 

-  +16 

(3, 

,4) 

+0  -  0  + 

24  - 

16  - 

+8 

The  solution  in  Table  16-46  is  optimal: 

Xij 

X12  -  76  Z  -  8(76)  -  608 

X22  “  21  +24(21)  -  504 

X23  -  41  +16(41)  -  656 

X24  -20  +  0(20)  -  0 

X31  -  72  +  8(72)  -  576 

X32  -  5  +16(5)  -  80 

$2,424 


Several  points  should  be  noted  about  this  optimal  solution.  First,  note 
that  bringing  (2,1)  into  the  solution  has  no  effect  on  Z.  Consequently,  an 
alternate  optimum  solution  exists: 

X12  -  76  Z  -  8(76)  -  608 
X21  ■  21  +16(21)  -  +336 
x23  *  41  +16(41)  -  +656 
X24  -  20  +  0(20)  -  0 
X31  -  51  +  8(51)  -  +408 
X32  -  26  +16(26)  -  +416 

$2,424 


Secondly,  the  optimal  solution  arrived  at  in  Table  16-46  after  several 
iterations  is  the  same  solution  previously  generated  by  the  VAM  procedure. 
This  illustrates  the  relative  efficiency  of  the  VAM  for  finding  an  initial 
basic  feasible  (and,  in  this  case,  optimal)  solution. 

The  MODI  solution  algorithm  is  a  variation  of  the  stepping-stone  proce¬ 
dure  that  uses  a  somewhat  more  sophisticated  technique  for  computing  the 
indices  used  to  test  the  solution  for  optimality  and  to  improve  nonoptimal 
solutions.  The  procedure  is  based  on  the  following  two  relationships: 


cij  *  +  Vj  (For  basic  variables) 

rij  "  Cij  -  Ui  -  Vj  (For  nonbasic  variables) 

where  Ij.j  represents  the  change  in  the  value  of  the  criterion  variable  (Z) 
which  would  result  from  introducing  one  unit  of  the  nonbasic  variable  X^j 
into  the  solution  basis.  These  relationships  are  derived  from  what  is 
known  as  the  dual  formulation  of  the  LP  model.  A  discussion  of  duality 
theory  is  not  included  here,  but  can  be  found  in  the  texts  referenced 
following  this  section.  To  illustrate  the  MODI  procedure,  the  example 
moduli  has  been  reproduced  in  Table  16-47.  The  initial  basic  feasible  solu¬ 
tion  shown  here  is  the  one  derived  using  the  Least  Cost  method  (see  Table 
16-40).  Notice  that  the  table  has  been  modified  to  include  a  column 
labeled  "0^"  and  a  row  labeled  ”VjM. 

We  begin  by  arbitrarily  setting  the  value  of  any  or  Vj  value  to  zero. 
For  example,  let  ■  0.  Now,  recall  that  for  each  basic  variable  (i.e., 
for  each  "filled"  cell): 


Cij  -  +  Vj 

With  this  relationship,  and  setting  one  of  the  Uj^  or  Vj  values  equal  to 
zero,  all  of  the  remaining  and  Vj  values  can  be  computed: 


1 


2 


3 


4(D) 


SUPPLY 


Solution  is  not  optimal;  there  is  a  tie  between  X21  and  X 3^  as  the 
entering  variable.  Select  Xg^  to  enter.  As  Xj^  enters,  X^i  leaves  and  the 
new  solution  basis  is:  Xj^  =  76,  X22  “  21,  X23  *  41,  X24  *  20,  X^  -  72, 
X32  -  5.  This  solution  is  shown  in  Table  16-51 


Table  16-51 


■ 

1 

_ 

2 

3 

4(D) 

Supply 

Ui 

1 

4 

© 

8 

M 

0 

76 

. 

0 

2 

16 

© 

24 

© 

16 

© 

0 

82 

16 

3 

© 

8 

© 

16 

24 

0 

77 

8 

DEMAND 

72 

102 

4*1 

20 

235 

0 

8 

0 

-16 

Again,  checking  for  optimality,  compute  Uj_  and  Vj  values: 
Ui  -  0 

v2  *  C12  -01-8-0-8 
02  3  C22  _  V2  -  24  -  8  -  16 

V3  =*  C23  -  U2  —  16  -  16  *  0 

V4  =*  C24  -  02  -  0  -  16  -  -16 

U3  -  C32  -  V2  -  16  -  8  -  8 

V4  -  C31  -  U3  -  8  -  8  -  0 


Computing  the  indices  for  the  nonbasic  variables: 


*11 

"  C11 

-  ux 

*13 

*  C13 

-  ux 

x14 

“  c14 

-  u1 

*21 

“  C2X 

-  u2 

*33 

“  c33 

-  u3 

*34 

“  c34 

-  U3 

Vx— 4-0-0-+4 
V3-M-O-O-+M 
V4  -  0  -  0  -(-16)  -  +16 
Vx  -  16  -  16  -  0  -  0 
V3  -  24  -  8  -  0  -  +16 
V4  -  0  -  8  -(-16)  -  +8 
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The  solution  of  Table  16-51  is  optimal 


Z  -  8(76)  +  24(21)  +  16(41)  +  0(20)  +  8(72)  +  16(5) 
Z  =  608  +  504  +  656  +  0  +  576  +  80 
Z  -  $2,424 


In  the  process  of  developing  the  initial  or  a  subsequent  basic  feasible 
solution,  it  is  not  uncommon  to  encounter  the  abberation  of  degeneracy.  In 
the  transportation  model,  a  solution  is  degenerate  when  there  are  fewer 
than  (m+n-1)  basic  variables.  When  this  condition  occurs,  it  is  no  longer 
possible  to  compute  the  improvement  indices.  Consider,  for  example,  the 
transportation  model  illustrated  in  Table  16-52. 


Table  16-52 


1 

2 

3 

SUPPLY 

4 

8 

8 

u 

1 

© 

© 

-M- 

16 

24 

16 

2 

© 

■35 

8 

16 

24 

3 

© 

35 

DEMAND 

-55 — 

35 

115 

(m+n-1 ) 

3+3- 1-5 

Only  4  basic 
cells:  initial 
solution  is 
degenerate . 


Using  the  Northwest  Corner  Rule  to  derive  an  initial  basic  feasible  solu¬ 
tion,  we  assign  35  to  (1,1),  closing  out  column  (1)  and  leaving  20  in  row 
( 1 ) .  These  20  are  assigned  to  (1,2),  closing  out  row  ( 1 )  and  leaving  25  in 
column  (2).  However,  when  we  next  allocate  those  remaining  25  units  to 
(2,2),  it  simultaneously  closes  out  both  row  (2)  and  column  (2). 
Consequently,  there  are  only  4  basic  variables  rather  than  the  m(3)+n(3)- 
1-5  required.  The  initial  basic  feasible  solution  is  degenerate.  Suppose 
you  had  not  recognized  this  condition  and  were  attempting  to  compute  impro¬ 
vement  indices  for  the  nonbasic  cells  (using  either  the  stepping-stone  or 
MODI  procedures).  For  cells  (1,3),  (2,3),  (3,1),  and  (3,2),  it  is  not 
possible  to  form  a  closed  loop. 


Fortunately,  this  is  not  a  difficult  condition  to  resolve.  Simply  intro 
duce  a  "0"  allocation  (sometimes  represented  by  "eM)  to  one  of  the  cells 
While  there  is  usually  a  great  deal  of  flexibility  in  selecting  the  cell 
for  the  0  allocation,  the  general  procedure,  when  using  the  Northwest 
Corner  Rule,  is  to  assign  it  to  the  cell  that  maintains  the  "stair-step" 
pattern  as  shown  in  Table  16-53. 


Table  16-53 


The  problem  can  now  be  solved,  treating  the  variable  having  the  e  or  0 
allocation  like  any  other  basic  variable.  The  0  allocation  is  simply  a 
computational  device.  It  has  no  significance  in  the  problem. 

Degeneracy  can  also  surface  after  the  initial  solution  has  been  determined 
Consider  the  partial  transportation  table  shown  below: 


Nonbasic  variable  X2^  h^s  8  negative  cell  indicator*  The  solution  is  not 
optimal .  As  X21  enters,  however ,  both  and  X22  go  to  0  simultaneously 
(a  tie  for  the  leaving  variable). 


As  in  the  case  of  the  degenerate  initial  basic  feasible  solution,  we 
correct  this  situation  by  introducing  an  e  or  0  allocation  into  one  of 
those  cells  simultaneously  driven  out  of  the  basis  and  proceed  with  the 
algorithm,  treating  the  cell  with  the  e  entry  like  any  other  basic  variable 


The  Assignment  Model 


m 


Just  as  the  transportation  model  is  a  special  case  of  the  general  LP 
formulation,  the  assignment  model  is  a  special  case  of  the  transportation 
model  (and,  therefore,  a  special  case  of  the  LP  model)*  The  unique  struc¬ 
ture  of  the  model  permits  solution  by  techniques  which  are  computationally 
more  efficient  than  either  the  transportation  or  simplex  algorithms . 


In  the  assignment  problem,  there  are  m«n  supply  and  demand  points. 
Unlike  the  transportation  problem,  however,  all  of  the  commodity  available 
at  a  supply  point  (i)  must  be  allocated  to  one  of  the  demand  points  (j). 

In  the  prototype  problem,  the  objective  is  to  assign  n  individuals  to  n 
jobs  in  such  a  way  as  to  minimize  cost,  maximize  efficiency,  etc.  Like  the 
transportation  model,  the  assignment  formulation  is  important  and  useful 
because  it  is  applicable  to  a  wide  variety  of  managerial  problems. 


General  System  Analysis 


System  Components: 


1.  Criterion  Variable  (Z) t  Total  cost  to  distribute  all  of  the 
commodity  available  at  each  of  n  supply  points  to  n  demand  points;  the 
objective  is  to  minimize  Z. 


2.  Decision  Variable  (X^ j ) :  X^j  >1  if  the  supply  at  i  is  assigned 
to  demand  point  j  and  0  if  it  is  not  (i, j-1 ,2, . . . ,n) . 

3.  Environmental  Variables: 


**  amount  of  commodity  assigned  from  supply  point  i 

(Si  -  1,  i«1,2,...,n) 

Dj  *  amount  of  commodity  assigned  to  demand  point  j 
(Dj  -  1,  j»1,2,...,n) 

S  “  total  supply  at  all  sources 

D  ”  total  demand  at  all  destinations 
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System  Relationships 


Figure  16-20 


■n 
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Mathematical  Formulation 


n  n 

minimizes  Z  »  J  ][  C,.  X.  . 

i-1  j-1  3  3 


■  •t. 


J  ■  1  (i  *  1*2/ • ■ • »n) 


j-1 

n 

I 

i-1 


*ij 


1  (  j  —  1,2, .. . ,n) 


hi  m 


Note  that  since,  in  the  assignment  model,  m-n  and  S^-d  -1,  total  supply 
equals  total  demand.  3 


Formulation  Examples 

Example  16-2,  is  a  "classical"  assignment  problem  involving  the  assign¬ 
ment  of  people  to  tasks. 


EXAMPLE  16-2 


Three  new  machines  of  different  types  have  been  purchased  for  installa¬ 
tion  and  use  in  one  of  the  shops  at  an  Air  Logistics  Center.  There  are 
four  available  locations  in  the  shop  at  which  one  of  the  machines  could  be 
installed.  However ,  some  of  these  locations  are  more  desirable  than  others 
for  particular  machines  because  of  their  proximity  to  various  work  centers. 
The  objective  is  to  assign  the  machines  to  the  potential  locations  in  such 
a  way  as  to  minimize  the  total  cost  of  materials  handling.  The  estimated 
cost  per  unit  time  of  materials  handling  involving  each  of  the  machines  is 
given  in  Table  16-54.  Note  that  machine  2  cannot  be  assigned  to  location  2 


MACHINE 


Table  16-54 


LOCATION 

1 

2 

3 

4 

1 

10 

15 

20 

12 

2 

14 

X 

12 

8 

3 

6 

18 

10 

15 

How  should  the  machines  be  located? 


1.  Criterion  Variable  (Z):  Total  coat  per  unit  time;  objective  is 
to  minimize  Z. 

2.  Decision  Variable  (X  j_ j)**1  if  machine  i  is  assigned  to  location 
j;  0  otherwise*  (Note:  to  formulate  this  as  an  assignment  model  with  mmn, 
it  is  necessary  to  introduce  a  "dummy"  machine  to  fill  the  extra  location). 

3.  Environmental  Variables :  Si  «  number  of  locations  assigned  to 
machine  i  (Si  -  1;  i-1 ,2, . . . ,n) .  Dj  *  number  of  machines  assigned  to  loca¬ 
tion  j  (Dj  -  1;  j— 1,2, ... ,n) . 
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Mathematical  (LP)  Model 


minimize :  Z  -  10Xn  +  15X12  +  20x13  +  12X14  +  14x21  +  12x23 
+  8X24  +  6X31  +  I8X32  +  IOX33  +  15x34  +  0X41 
+  OX42  +  OX43  +  OX44 


s«t.  Xu  +  X]_2  +  Xj_3  +  Xj_4  —  1 

X21  +  x22  +  x23  +  x24  “  1 

X31  +  x32  +  x33  +  x34  "  1 

X41  +  ^2  +  *43  +  ^4  “  1 


X11  +  X21  +  X31  +  *41  *  1 
x12  +  x22  +  x32  +  ^42  -  1 
x13  +  x23  +  x33  +  *43  ’  1 
x14  +  x24  +  x34  +  ^4  "  1 

XAj  -  0,1  (i,j  -  1,2,.. .,4) 


Solution  Techniques 


As  a  special  case  of  the  transportation  model,  the  assignment  problem 
can  be  solved  with  either  the  transportation  algorithm  or  the  more  general 
simplex  algorithm.  In  addition,  a  more  computationally  efficient  tech¬ 
nique,  known  as  the  Hungarian  Method,  has  been  devised  for  solution  of 
assignment  problems.  Basically,  this  procedure  employs  the  concept  of 
opportunity  oosts  and  can  be  illustrated  with  the  example  presented  in  the 
previous  section: 

The  procedure  begins  by  subtracting  from  each  row  and  column  of  the 
cost  table  the  smallest  value  in  that  row  or  column. 


1. 


1 

2 

3 

4 

1 

10 

15 

20 

12 

2 

14 

M 

12 

8 

3 

6 

18 

10 

15 

4 

0 

0 

0 

0 

Smallest  Row  Value 


10 


8 

6 

0-24 
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Subtracting  these  minimum  row  values: 


i 

2 

_ 

3 

4 

1 

0 

10 

2 

m 

6 

4 

0 

B 

0 

12 

4 

9 

i 

0 

0 

0 

Now,  subtracting  the  minimum  column  values  gives  the  same  table  because 
there  is  at  least  one  0  in  each  column* 

In  this  algorithm,  assignments  are  made,  if  possible,  to  cells  with 
"0"  values*  After  step  (1)  above,  there  will  be  at  least  one  "0"  in 
each  row  and  column.  A  convenient  method  exists  for  determining  if  the 
solution  is  optimal.  Attempt  to  "cover"  all  of  the  zero  values  by  drawing 
vertical  and/or  horizontal  straight  lines.  If  the  minimum  number  of  lines 
required  to  cover  all  zeros  equals  n,  the  number  of  rows  and  column,  the 
solution  is  optimal.  If  the  number  of  lines  required  to  cover  all  zeros  is 
less  than  n,  the  optimal  solution  cannot  be  made.  In  this  example,  an 
optimal  solution  is  not  yet  possible  as  all  of  the  zeros  can  be  covered  by 
a  minimum  of  three  lines: 


1 _ 

2 

3 

1 

5 

N 

12 

10 

4 

4 

l 

> 

> 

n 

□ 

n 

To  improve  the  solution,  find  the  smallest  "uncovered”  value  and: 

1.  subtract  it  from  the  other  uncovered  values,  and 

2.  add  it  to  those  values  at  the  intersections  of  two  lines. 
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In  this  example,  4  is  the  smallest  uncovered  value.  Subtracting  it  from 
the  other  uncovered  values  and  adding  it  to  the  intersection  values,  i.e., 
(4,1)  and  (4,4),  gives  the  following  revised  table: 


Row  1,  Col  taut  2,  and  Column  4  hava  only  ona  aero  each.  Therefore,  these 
assignments  must  be  made,  as  indicated  by  the  square  drawn  around  the  0. 
Since  each  row  and  column  can  have  only  one  5C  j  >1 ,  the  optimal  assignment 
is:  J 


*11  - 

1 

C11 

■ 

10 

x24  “ 

1 

C24 

■ 

8 

X33  “ 

1 

C33 

SB 

10 

X42  ” 

1 

C42 

m 

0 

Z 

~ 

~2B 

Note  that  the  sum  of  all  values  subtracted  from  the  table  (24+4)  is  equal 
to  Z. 
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CHAPTER  17 


NETWORK  ANALYSIS 


INTRODUCTION 

In  the  discussion  of  linear  programming,  we  (hopefully)  demonstrated 
the  importance  and  utility  of  visualizing  and  analyzing  a  decision  situa¬ 
tion  as  a  system.  In  describing  a  system,  we  suggested  that  any  system  is 
comprised  of  a  set  of  relevant  dependent  and  independent  variables  linked 
by  relationships.  An  important  system  characteristic,  implied  but  not 
emphasized,  was  that  such  systems  can  be  interpreted  as  a  network  through 
which  some  element  flows.  The  element  flowing  through  a  network  system  may 
be  virtually  anything,  either  physical  or  nonphysical,  e.g.,  people,  money, 
time,  information,  equipment,  vehicles,  social  influence,  etc.  The  concept 
of  network  systems  and  the  study  of  such  systems  (network  analysis)  has 
played  an  important  role  in  such  disciplines  as  electrical  engineering, 
communications,  computer  science,  transportation  engineering,  and  cyber¬ 
netics.  More  recently,  network  systems  and  their  analysis  have  surfaced  as 
an  important  tool  for  managers  in  a  variety  of  organizational  settings. 

Perhaps  most  notably,  network  analysis  has  emerged  as  a  fundamental  tech¬ 
nique  for  managing  the  timely  and  effective  accomplishment  of  projects, 
particularly  in  the  research  and  development  and  the  construction  areas. 

In  addition,  network  analysis  is  now  being  used  as  a  basic  tool  for 
studying  social,  economic,  and  political  systems. 

In  this  section,  basic  network  concepts  are  introduced.  These  concepts 
are  used  in  a  number  of  algorithms  which  have  been  developed  to  solve  network- 
type  problems  which  frequently  occur  in  a  variety  of  managerial  decision 
situations.  Specifically,  we  will  discuss  the  following  network  models: 

a.  Shortest  Route  Problem 

b.  Minimal  Spanning  Tree  Problem 

c.  Maximal  Flow  Problem 

d.  Closed  Circuit  Problem 

This  section  concludes  with  a  discussion  of  project  (system)  management 
and  the  use  of  such  techniques  as  the  Critical  Path  Method  (CPM)  and  the 
Program  Evaluation  and  Review  Technique  (PERT). 

Basic  Network  Concepts  and  Definitions 

To  facilitate  the  discussion  of  network  systems  and  the  analysis  of 
such  systems,  it  is  necessary  to  introduce  a  number  of  basic  concepts  and 
definitions. 
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1.  BRANCH:  A  line  segment  connecting  two  distinct  end  points  or  stations* 

(a)  NODE:  The  end  point  of  a  branch.  Also  called  a  vertex  or  junc¬ 
tion  point 

(b)  ARC:  The  line  segment  of  a  branch  connecting  the  two  bounding 
nodes.  Also  known  as  edges  or  links. 


NODE 


© 


Figure  17-1.  BRANCH 
ARC 


© 


NODE 


(c)  NODE  INDEX:  Each  node  has  a  unique  label,  i.e.,  a  letter  or 
number.  A  specific  branch  is  identified  by  the  labels  on  its 
nodes  ( i , j  ) . 

(d)  DIRECTED  BRANCH:  A  branch  in  which  the  direction  of  flow  is 
specified.  The  direction  of  flow  is  indicated  with  an  arrow: 

Fiqure  17-2.  DIRECTED  BRANCH 

© - -  -  ►© 

(e)  CAPACITATED  BRANCH:  A  branch  in  which  the  magnitude  of  flow 
is  restricted  to  some  limiting  value. 

2.  CHAIN:  A  sequence  of  branches  connecting  node  i  with  node  j,  such  that 
each  node  in  the  sequence  is  encountered  only  once: 


© 


Figure  17-3.  CHAIN 

© - =0 


© 


3.  PATH:  A  chain  in  which  the  direction  of  flow  i»  specified: 


G> 


Figure  17-4.  PATH 

>0 - =K> 


4.  LOOP:  A  branch  in  which  the  two  nodes  are  coincident: 

Figure  17-5.  LOOP 


5.  CYCLE:  A  chain  in  which  the  two  terminal  nodes  are  coincident: 

Figure  17-6.  CYCLE 


K> 


K> 
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9.  DIRECTED  NETWORK;  A  network  in  which  the  direction  of  flow  is  spe¬ 
cified  for  all  branches: 


10.  SOURCE  NODE:  A  node  in  a  network  from  which  all  flow  moves  away  in 
each  incident  branch.  Also  called  an  ORIGIN  NODE. 

11.  SINK  NODE:  A  node  in  a  network  toward  which  the  flow  moves  in  all 
incident  links  or  branches .  Also  called  a  TEMINAL  NODE  or  DESTINATION  NODE 

12.  INCIDENCE  MATF~X:  A  matrix  that  provides  a  complete  statement  or  sum¬ 
mary  of  the  graph  or  network  structure. 


Figure  17-11.  INCIDENCE  MATRIX 
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If  the  network  is  capacitated  or  the  branches  have  some  specific  values 
associated  with  them,  these  values  can  be  entered  in  the  matrix.  The  inci¬ 
dence  matrix  is  important  because  this  is  the  vehicle  used  to  enter  the 
network  structure  in  a  computer  program. 


13.  OTHER  NETWORK  CONCEPTS; 

1 .  Network  diagrams  are  generally  schematic  only  and  not  drawn  to 
scale . 


2.  If  the  network  is  a  model  of  a  dynamic  system,  you  should  specify 
the  system  state  life,  i.e.,  the  period  over  which  the  network 
structure  holds  or  is  expected  to  hold. 

3.  Values  associated  with  network  branches  (and/or  nodes)  may  be 
either  deterministic  or  stochastic,  depending  on  the  nature  of  the 
system  being  modeled. 


4.  There  are  usually  no  "line  losses"  in  the  system,  i.e.,  there  is 
conservation  of  flow.  The  amount  of  flow  entering  a  node  must 
equal  the  amount  leaving  the  node.  However,  in  multi-period 
models,  flow  may  be  "stored"  for  one  or  more  periods. 


17.1  NETWORK  SYSTEM  MODELS 

We  will  now  look  at  the  formulation  and  analytical  solution  of  four 
rather  "classical"  network  analysis  problems.  In  the  minimal  path  or  short¬ 
est  route  problem  the  objective  is  to  find  the  shortest  (or,  alternatively, 
least  expensive)  route  through  a  network  that  connects  some  source  and 
destination.  The  minimal  spanning  tree  problem  is  a  variation  of  the  shor¬ 
test  route  model.  Here  the  problem  is  to  link  up  a  set  of  nodes  with 
branches  in  such  a  way  as  to  minimize  the  total  length,  cost,  time,  etc. 

In  the  maximal  flow  problem  we  axe  concerned  with  a  capacitated  network  and 
want  to  know  what  maximum  flow  rate  can  be  carried  through  the  network. 

The  final  problem  considered  in  this  section,  the  closed  circuit  or 
"traveling  salesman"  problem,  deals  with  a  network  in  which  we  want  to  know 
the  shortest  route  through  the  network  that  encounters  each  node  once  and 
only  once  and  then  returns  to  the  starting  point.  For  each  model,  the 
discussion  will  include  a  brief  description  of  the  problem  and  one  of  the 
available  solution  algorithms  for  that  particular  formulation.  An  example 
is  also  presented  to  illustrate  application  of  the  solution  algorithm.  It 
should  be  pointed  out  that  each  of  these  problems  can  be  formulated  as  a 
mathematical  model.  However,  these  models  are,  in  some  cases,  rather 
complex  and  beyond  the  scope  of  a  handbook  discussion.  A  number  of 
excellent  references  on  network  analysis  are  available  for  those  interested 
in  a  more  complete  and  detailed  treatment  of  the  subject. 
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MINIMAL  PATH/SHORTEST  ROUTE  PROBLEM 


Problem  Description 

Given  a  (directed  or  undirected)  network  In  which  each  branch  has  some 
assigned  magnitude,  find  the  path  from  some  specified  node  (1)  to  some  other 
specified  node  (j)  having  the  smallest  total  magnitude,  i.e.,  the  minimal  path 
or  shortest  route  between  ( i )  and  ( j ) •  For  example ,  find  the  minimal  path 
between  node  (1)  and  node  (9)  In  Figure  17-12. 

Figure  17-12 


The  numbers  assigned  to  the  network  branches  may  represent  a  variety  of 
system  characteristics,  e.g.,  time,  distance,  cost,  risk  or  other  measures 
of  utility.  If,  for  example,  the  numbers  in  the  above  network  represent 
cost,  we  would  be  interested  in  finding  the  least  expensive  route  between  two 
specified  nodes,  e.g.,  node  (1)  and  node  (9).  Note  that  there  is  no  require¬ 
ment  for  the  "flow"  between  two  nodes  to  be  the  same,  i.e.,  C^j  ¥  Cj^. 

Solution  Techniques 

A  number  of  general  and  specialized  techniques  have  been  developed  to 
solve  minimal  path  problems,  e.g.: 

1 .  mathematical  programming  techniques : 


(a)  linear  programming 


(b)  integer  programming 

( c )  dynamic  programming 


2.  specialized  network  algorithms: 


(a)  graphical  method 

( b)  labeling  method 

(c)  matrix  method 

Mathematical  programming  approaches  will  handle  reasonably  large 
problems  but  don' t,  in  general,  take  advantage  of  the  special  mathematical 
structure  of  the  problem.  The  graphical  algorithms  are  easy  to  apply  but 
are  practically  limited  to  small  networks.  The  labeling  and  matrix  methods 
can  handle  relatively  large  networks  in  a  more  efficient  manner  than  can 
the  mathematical  programming  techniques*  In  addition,  these  two  spe¬ 
cialized  algorithms  can  be  readily  programmed  on  the  computer. 

EXAMPLE  17-1 

Labeling  Method  For  Solution  Of  The  Minimal  Path  Problem 

1 •  Construct  the  network  diagram,  labeling  all  nodes  and  arcs  as  shown 
in  Figure  17-12. 

2.  Construct  a  table  having  one  column  for  each  node: 


Table  17-1 


3.  In  the  column  under  each  node  index,  list  all  branches  leading  out 
of  that  node,  together  with  the  corresponding  branch  value,  i.e.,  distance, 
time,  cost,  etc.  Branches  leading  into  the  origin  or  away  from  the  desti¬ 
nation  need  not  be  listed.  The  branches  and  their  associated  values  should 


1 


be  listed  in  order  of  increasing  branch  value,  i.e.,  the  smallest  value 
first,  etc*: 


Table  17-2 


1 

t  . 

i 

2 

3 

4 

5 

6 

7 

8 

9 

! 

' 

1,2-2 

2,5-3 

3,4-1 

4,3-1 

5,2-1 

6,5-1 

7,3-1 

8,4-3 

i 

1,3-3 

3,7-1 

4,8-3 

5,6-1 

6,3-2 

D 

8,7-4 

t 

1,4-5 

3,6-2 

B 

5,9-2 

6,7-3 

B 

8,9-6 

3,5-5 

1 

5,3-5 

6,9-4 

■ 

1  V 

i, 


4.  "Label"  the  origin  node  (1)  by  entering  a  "0"  above  the  node  index 
( column  heading) .  This  indicates  the  minimal  distance  from  the  origin  to 
this  node.  Since  this  node  is  the  origin,  the  minimal  distance  must  be  0. 
At  this  point  then,  only  node  1  is  labeled;  all  other  nodes  are  unlabeled: 


Table  17-3 


0 

1 

2 

3 

n 

5 

6 

7 

8 

9 

5.  For  each  unlabeled  node  that  is  connected  directly  to  the  labeled 
node  ( at  this  point ,  nodes  2 ,  3 ,  and  4 ) ,  compute  the  minimal  path  from  the 
origin  by  adding  the  branch  value  to  the  minimal  path  value  previously 
found  for  the  labeled  node  (0  for  the  origin): 


(1,2): 

0 

+ 

2 

■  2  (minimal) 

(1,3) 

0 

+ 

3 

-  3 

(1,4) 

0 

+ 

5 

-  5 
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1 

I 


4 


i 


6.  Select  the  unlabeled  node  that  has  the  smallest  total  path  value, 
i.e.,  the  unlabeled  node  that  is  closest  to  the  origin.  Note  that  if  the 
above  procedures  have  been  followed  correctly,  this  will  be  the  first  entry 
in  the  origin  column.  As  determined  in  the  previous  step,  for  this  case, 
node  2  is  closest  to  the  origin. 

7.  Label  this  closest  node  (2)  with  the  minimal  path  value  found  in 
step  5  (i.e.,  2).  Now  nodes  1  and  2  are  labeled. 


Table  17-4 


0 

2 

1 

2 

3 

4 

5 

6 

7 

8 

9 

8.  In  the  column  corresponding  to  the  origin,  circle  the  branch  to  the 
selected  node  (i.e.,  "1,2-2").  This  indicates  that  the  minimal  path  to 
node  2  from  the  origin  node  is  2  units  and  that  the  route  is  branch  (1,2). 

9.  Since  the  minimal  path  to  node  2  has  now  been  found  (recognized  by 
the  fact  that  node  2  has  been  labeled) ,  all  other  branches  leading  into 
this  node  may  be  eliminated  from  further  consideration.  Consequently, 
cross  out  all  such  branches  in  the  table  (i.e.,  all  branches  (i,j)  in  which 
j*>2).  In  this  case,  branch  (5,2)  is  crossed  out: 


Table  17-5 


9 


10.  For  each  labeled  node  (now  1  and  2),  add  the  minimal  path  value 
for  that  node  to  the  smallest  branch  value  in  that  column.  The  node 
corresponding  to  the  smallest  of  these  sums  is  the  next  node  to  be  labeled 
as  nearest  the  origin.  If  two  or  more  nodes  are  tied  for  the  smallest  sum. 
then  all  such  nodes  are  labeled: 


(1.3): 

(2,5) 


0  +  3 
2  +  3 


(minimum) 


11.  Label  the  selected  node  (node  3)  with  the  minimal  path  value  com¬ 
puted  in  the  previous  step  (i.e.,  3).  Circle  the  selected  branch  (1,3)  and 
cross  out  all  other  branches  leading  into  the  selected  node: 


Table  17-6 


1,4-5 


2 

3 

2 

3 

2,5-3 

3,4-1 

3,7-1 

3,6-2 

3,5-5 

6,5-1  8,4-3 

5,6-1  6>^2  7,6-3  8,7-4 


5,9-2  |  6,7-3  7,4-5  8,9-6 

6,9-4  7,8-5 


12. 

Repeat  steps  10 

and  11 

until 

the 

terminal  node  (in  this  case  node 

9)  has 

been  labeled: 

(1.4): 

0 

+ 

5 

«  m 

5 

(2,5): 

2 

+ 

3 

- 

5 

(3,4): 

3 

+ 

1 

m 

4 

( Tie :  Label  both  4  and  7 ) 

(3,7): 

3 

+ 

1 

“ 

4 

Circle 

(3,4)  and  (3,7); 

cross  out  all 

lit 

4) 

and 

(i,7)  branches. 

Circle  (2,5)  and  (3,6);  cross  out  all  (1,5)  and  (i,6)  branches 
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When  node  9  has  b««n  labs  led,  we  have  found  the  path  frost  nods  1  to 

nods  9.  To  tracs  the  path,  work  "backwards1*  through  the  circled  nodes  in 
the  table,  i.e. ,  9-5-2- 1 .  Therefore,  the  ainiaal  path  value  of  7  units  Is 
found  along  path  1-2-5-9.  The  completed  table  is  as  follows! 


Table  17-7 


MINIMAL  SPANNING  TREE  PROBLEM 


Problem  Description 

The  minimal  panning  t xee  problem  Is  a  variation  of  tha  minimal 
path/shortast  routa  problem.  Given  a  (directed  or  undirected)  network  in 
which  each  branch  has  some  assigned  magnitude  or  value,  find  the  tree 
having  the  minimal  total  value,  i.a. ,  select  the  branches  having  the 
smallest  total  value  while  providing  a  direct  or  indirect  route  between 
each  pair  of  nodes. 

Solution  Technique 

1.  Draw  the  network  diagram,  labeling  all  nodes  and  arcs. 

2.  Arbitrarily  select  any  node  and  connect  it  to  the  nearest  distinct 
node* 

3.  Identify  the  unconnected  node  that  is  nearest  to  a  connected  node 
and  then  connect  these  two  nodes.  Repeat  this  step  until  all  nodes  have 
been  connected. 

EXAMPLE  17-2: 


Figure  17-13 


2.  A  is  the  unconnected  node  closest  to  either  node  B  or  C.  Connect 
A  to  B.  Now  A,  B,  and  C  are  connected: 


3*  0  is  now  the  unconnected  node  nearest  to  A,  B,  or  C.  Connect 

0  to  Am  Now  0,  A ,  B,  and  C  are  connected: 


4.  B  Is  the  unconnected  node  nearest  to  0,  A,  B,  or  C.  Connect  S 
to  B.  Now  nodes  0,  A,  B,  C,  and  B  are  connected: 

r\ 


Figure  17-17 


5.  D  is  now  the  unconnected  nods  nearest  to  0,  A,  B,  C,  or  E.  Connect 
to  E.  Nodes  0»  A,  B,  C,  0/  and  E  are  now  oonnected: 


The  number  on  a  branch  ( i ,  j)  nearest  node  (i)  represents  the  capacity 
of  the  branch  in  the  direction  from  (i)  to  (j).  For  example ,  on  branch  0-A, 
there  is  a  flow  capacity  of  5  units  in  the  direction  from  0  to  A  and  a  capac¬ 
ity  of  0  in  the  direction  from  A  to  0.  The  objective  is  to  determine  the 
total  amount  of  flow  that  can  be  carried  by  the  network  from  node  0  to  node  T. 

Stated  more  formally:  Consider  a  connected  network  having  a  single 
SOURCE  and  a  single  SINK.  Assume  CONSERVATION  of  FLOW  at  each  node  other 
than  the  source  and  sink.  Suppose  that  the  rate  of  flow  along  branch  ( 1, 
j )  from  node  jL  to  node  J,  can  be  any  nonnegative  quantity  not  exceeding  the 
specified  FLOW  CAPACITY  (Cj_j ) .  The  objective  is  to  determine  the  feasible 
steady-state  .pattern  of  flows  through  the  network  that  maximizes  the  total 
flow  from  the  source  too  the  sink. 

Solution  Techniques 

The  maximal  flow  problem  can  be  formulated  as  a  linear  programing  problem 
and  solved  by  the  simplex  method  or  by  any  of  the  available  linear  programming 
computer  codes,  e.g.,  LINPRO  or  UBELP.  An  even  more  efficient  solution 
procedure  is  available  and  is  sumarized  as  follows: 


Step  1 :  Find  a  path  from  source  to  sink  with  strictly  positive 
flow  capacity.  (If  none  exists,  the  net  flows  already  assigned  constitute 
an  optimal  flow  pattern) . 

Step  2:  Search  this  path  for  the  branch  with  the  smallest 
remaining  flow  capacity  (denote  this  by  C*) ,  and  incrsase  the  flow  in  this 
path  by  C*. 
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Step  3:  Decrease  by  C*  the  regaining  flow  capacity  of  each  branch 
in  the  path.  Increase  by  C+  the  remaining  flow  capacity  in  the  opposite 
direction  for  each  branch  in  the  path#  Return  to  Step  1 . 


EXAMBU 


Figure  17-21 


ITERATION  1:  Arbitrarily  select  a  path  through  the  network:  0-B-D-T  at  a 
flow  of  C*"4.  Adjust  the  remaining  flow  capacity: 


OV  A 


Figure  17-22 


T  >  -A> 


ITERATION  2:  Select  path  O-C-E-T  9  4  units  of  flow: 

Figure  17-23 


CLOSED  CIRCUIT/TRAVELING  SALESMAN  PROBLEM 


Problem  Description 

In  the  closed  circuit  or  traveling  salesman  problem,  the  object  is  to 
determine  the  minimal  circuit  through  a  network  that  encounters  each  node 
once  and  only  once  and  returns  to  the  starting  node. 

As  in  the  case  of  the  minimal  path  problem,  the  values  associated  with 
each  branch  may  represent  time,  distance,  cost,  risk,  or  other  measure  of 
utility. 

The  problem  has  become  well-known  and  has  been  studied  for  years .  This 
attention  is  because  the  problem  is  easy  to  state  but  computationally  diffi¬ 
cult  to  solve.  In  addition,  much  of  the  attention  given  this  problem  is  due 
to  the  wide  range  of  problem  situations  that  can  be  modeled  by  this  formula¬ 
tions 

1 .  police  patrol 

2 .  refuse  pick-up 

3.  preventive  maintenance  routes 

4.  staff  assistance  visits 

Obviously,  a  solution  exists  for  the  problem  and  the  minimal  circuit 
could  be  determined  by  explicit  and/or  implicit  enumeration.  However, 
considering  the  combinatorial  nature  of  the  problem,  the  difficulty  arises 
when  you  realize  that  in  a  network  of  n  nodes,  there  are  (n-1)!  feasible 
routes.  A  network  of  only  10  nodes,  for  example,  generates  9!  or  362,880  ' 
routes. 

Solution  Techniques 

A  number  of  solution  approaches  have  been  recommended  including  dynamic 
programming  and  0-1  integer  programming. 

The  integer  programming  formulations  are  commonly  solved  with  various 
" branch-and- bound"  algorithms.  The  technique  basically  partitions  the 
problem  into  successively  smaller  and  smaller  problems,  eliminating  routes 
from  further  consideration  on  the  basis  of  implicit  enumeration  and  evalua¬ 
tion.  The  partitioning  process  continues  until  a  single  optimal  route 
remains . 


EXAMPLE  17-4 

An  inspector  wishes  to  visit  each  of  six  project  sites  and  return  to 
his  home  office.  The  following  table  summarizes  the  distances  of  the  most 


direct  routes  between  each  pair  of  project  locations 

Table  17-8 
TO  SITE:  (1)  : 


1 

2 

3 

4 

5 

6 

1 

M 

27 

43 

16 

30 

26 

FROM  SITE 

2 

7 

M 

16 

1 

30 

25 

3 

20 

13 

M 

35 

5 

0 

(i) 

4 

21 

16 

25 

M 

18 

18 

5 

12 

46 

27 

48 

M 

5 

6 

23 

5 

5 

9 

5 

M 

(Mote:  The  "M"  is  a  very  large  number  introduced  to  model  the  condition 
that  the  inspector  must  move  from  site  i  to  site  j  -  he  cannot  remain  at 
one  place) . 

In  what  sequence  should  the  inspector  visit  the  sites  to  minimize  the  total 
distance  traveled? 

1 .  The  first  step  in  the  solution  algorithm  is  to  determine  the  mini¬ 
mum  distance  in  each  row  and  column  and  to  subtract  that  quantity  from  each 
of  the  other  elements  in  that  row  or  column: 


Table  17-9 


3  20 

4  5 

5  7 

6  18 


2 

3 

4 

11 

27 

0 

M 

15 

0 

13 

M 

35 

0 

9 

M 

41 

22 

43 

0 

0 

4 

Table 

17-10 

2 

3 

4 

11 

27 

0 

M 

15 

0 

13 

M 

35 

0 

9 

M 

41 

22 

43 

0 

0 

4 

0 

0 

0 

ROW  CONSTANTS 
16 
1 
0 

16 

5 

_5 

43 


[_6 _ 13 

Column  Constant  5 


Note  that  each  row  and  column  now  contains  at  least  one  "0".  The  distances 
subtracted  from  each  row  and  column  should  be  added:  43  +  5  ®  48.  This 
forms  a  "lower"  bound  on  the  set  of  all  tours: 


ALL 

TOURS 


2.  The  next  step  is  to  partition  this  set  of  all  tours  into  two  subsets. 
This  is  done  by  selecting  some  particular  branch  (i,j)  and  using  this  to 
partition  the  set  of  all  tours  into  a  subset  of  tours  containing  (i,j)  and 
a  complimentary  subset  of  tours  which  do  not  contain  the  branch  ( i ,  j  ) : 

Figure  17-28 


The  particular  branch  selected  as  the  basis  for  this  partitioning  is  deter¬ 
mined  by  calculating  opportunity  costs  or  penalties  for  each  0-entry  in  the 
table.  This  is  done  to  identify  the  additional  cost  or  penalty  incurred  if 
that  branch  (i,j)  designated  by  a  particular  zero  is  not  included  in  the 
circuit  or  tour. 

a.  The  penalty  for  each  0  entry  is  calculated  by  adding  together  the  next 
smallest  values  in  row  (i)  and  column  (j).  This  value  is  entered  in  the 
cell  ( i ,  j )  and  circled.  For  example,  for  the  0  entry  in  (1,4),  the  next 
smallest  entry  in  row  (1)  is  10.  Similarly,  the  next  smallest  entry  in 
column  (4)  is  0.  The  total  penalty  for  (1,4)  is  10+0*10.  The  penalties 
for  each  of  the  0  cells  are  computed  in  a  similar  fashion: 


1 

2 

3 

M 

11 

27 

1 

M 

15 

15 

13 

M 

5 

2 

6 

13 

5 

6 

14 

10 

29 

24 

-2 


1 


« 


b.  The  branch  having  the  largest  penalty  is  selected,  in  this  case  (1,4). 
All  of  those  tours  which  do  not  include  the  branch  (1,4)  will  incur  an 
additional  penalty  (distance)  of  10: 

Figure  17-29 


3 .  Since  we  have  examined  row  ( 1 )  and  column  ( 4 ) ,  these  can  be  eliminated 
from  further  consideration,  and  the  matrix  is  collapsed  by  removing  these: 


Table  17-12 


1 

2 

3 

5 

6 

2 

1 

M 

15 

29 

24 

3 

15 

13 

M 

5 

0 

D 

M 

0 

9 

2 

2 

5 

2 

41 

22 

M 

0 

6 

13 

0 

0 

0 

M 

In  addition,  we  need  to  take  some  action  to  preclude  "subtours",  i.e., 
selecting  a  branch  that  would  take  us  back  to  a  previously  connected  node 
before  all  n(6)  nodes  have  been  linked.  In  this  case,  we  need  to  insure 
that  we  don't  select  (4,1).  This  is  done  by  placing  an  "M"  in  that  posi¬ 
tion.  This  completes  one  iteration  of  the  algorithm. 

• 
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4.  To  determine  the  lower  bound  on  (1,4),  we  again  proceed  as  in  (1) 
above  by  "reducing"  the  matrix,  i.e.,  by  subtracting  minimum  row  and  column 
values  to  create  at  least  one  0  in  each  row  and  column: 


Table  17-13 
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Figure  17-30 
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for  those  cells  having  0  entries: 
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Select  branch  (2,1): 


Figure  17-31 


6.  The  matrix  Is  again  collapsed  by  eliminating  row  (2)  and  column  (1) 
To  prevent  subtours,  an  M  Is  Introduced  Into  cell  (4,2): 
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7.  We  reduce  this  matrix  to  determine  the  lower  bound  (2,1): 

Table  17-16 
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Determine  the  next  branch  by  computing  the  penalties 


Note  that  there  is  a  tie  for  the  cell  having  the  greatest  penalty,  i 
(3,5)  and  (6,2).  Arbitrarily  select  (3,5)  as  the  next  branch: 


ALL 


Figure  17-35 


Select  (6,3): 


Figure  17-38 


To  determine  the  lower  bound  on  (6,3)  we  collapse  and  reduce  the  matrix 
and  set  (3,6)  equal  to  M  to  preclude  a  subtour: 
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15.  At  this  point  we  know  that  the  subset  of  tours  which  does  not  in¬ 
clude  (1,4)  but  which  does  include  (6,3)  must  be  longer  than  63.  Since 
this  is  not  lower  than  the  bound  on  the  subset  containing  (3,5),  we  can 
return  to  that  subset  and  continue  the  procedure  by  determining  the  next 
branch: 
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Since  there  again  is  a  tie  for  the  cell  having  the  largest  penalty, 
arbitrarily  select  (4,3): 


63+M 


To  determine  the  bound  on  the  subset  containing  (4,3)  we  collapse  and 
reduce  the  matrix: 


Table  17-27 


The  reduced  matrix  contains  only  one  cell  (6,2)  having  a  0  entry. 
Therefore,  the  bound  on  this  subset  is:  63  +  0  *>  63 
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The  remaining  branch  (6,2)  is  automatically  determined •  The  complete  solu- 


The  optimal  circuit  is:  1-4-3-5-6-2-7 
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17.2  PROJECT  SYSTEM  MANAGEMENT 


A  project  is  a  purposeful  activity  directed  at  accomplishing  some  spe¬ 
cific  objective.  It  is  usually  an  activity  composed  of  a  number  of  constitu¬ 
ent  tasks  or  "subactivities."  These  component  tasks  are  usually  sequenced 
or  organized  by  specific  precedence  relationships.  Chase  and  Aquilano  (1973, 
p.  503)  suggest  that  "A  project  is  a  series  of  related  jobs  usually  directed 
at  some  major  output  and  requiring  an  extensive  period  of  time  to  perform." 

In  general,  a  project  can  be  thought  of  as  a  system  of  interrelated  discrete 
task  subsystems.  Both  the  system  and  its  constituent  subsystems  have 
finite  lifetimes.  This  characteristic  is  perhaps  the  most  distinguishing 
feature  of  a  project  system. 

The  term  "project  management"  refers  to  the  management  of  project 
systems.  More  specifically,  project  management  refers  to  the  making  (and 
taking)  of  decisions  concerning  the  processes  of  planning,  organizing, 
directing,  and  controlling  project  systems.  Particular  emphasis  is  given 
to  the  planning  and  controlling  processes,  with  the  term  planning  referring 
to  systems  analysis,  resource  allocation,  and  scheduling.  The  control  pro¬ 
cess  is  generally  concerned  with  identifying  operational  measures  for  pro¬ 
ject  system  tasks,  setting  associated  objectives,  and  monitoring  progress 
or  performance  in  meeting  these  operational  objectives. 

Project  System  Analysis 

The  term  "project  system  analysis"  refers  to  the  process  of  defining, 
analyzing,  and  measuring  the  various  elements  of  a  project  system,  i.e., 
the  component  tasks,  the  relationships  between  system  tasks,  and  the  behavior 
of  the  project  system  as  a  whole.  Operationally,  the  problem  of  project 
system  analysis  can  be  distilled  down  to  two  fundamental  questions: 

1.  What  do  we  want/need  to  do?  (DECISION) 

2.  How  will  we  evaluate  the  results  of  our  efforts?  (CONTROL) 

The  process  of  project  system  analysis  can  be  facilitated  by  using  a 
conceptual  framework  to  guide  cur  thinking.  The  framework  first  focuses  on 
the  DECISION  question: 

PURPOSE:  The  first  step  is  to  explicitly  define  the  purpose  of  the 
project  system.  The  purpose  is  simply  a  general  statement  of  what  we  are 
trying  to  do  or  accomplish,  e.g.,  construct  a  new  facility,  repair  a  mecha¬ 
nical  system,  remove  snow  from  the  base,  etc. 

KEY  RESULT  AREAS:  While  the  statement  of  purpose  provides  our  ana¬ 
lysis  with  a  general  direction,  it  is  usually  too  broad  to  serve  as  a 
meaningful  basis  for  management  action.  Consequently,  we  need  to  identify 
the  primary  subsystems  of  our  total  project  system.  These  primary  sub¬ 
systems  are  commonly  referred  to  as  Key  Result  Areas,  because  they  identify 
major  task  areas  in  which  we  must  insure  satisfactory  results  if  the  basic 


purpose  of  our  project  system  is  to  be  met.  In  a  facility  construction 
project/  for  example.  Key  Result  Areas  might  include  structural  systems/ 
civil  systems,  mechanical  systems,  electrical  systems,  etc.  The  rela¬ 
tionship  between  the  purpose  (system  level)  and  Key  Result  Areas  (primary 
subsystem  level)  is  a  simple  hierarchy: 
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EVENTS:  Applying  the  same  logic  to  each  primary  subsystem  or  Key 
Result  Area,  we  next  identify  the  major  tasks  or  "Events"  that  define 
satisfactory  accomplishment  of  the  particular  Key  Result  Area.  Events 
correspond  to  secondary  subsystems  or  major  program  milestones.  For 
example,  considering  the  mechanical  system  Key  Result  Area  in  the  facility 
construction  example,  we  might  define  the  completion  of  the  plumbing  system 
as  one  event  and  completion  of  the  heating  and  air  conditioning  system  as 
another  event.  The  relationship  between  Events,  Xey  Result  Areas,  and 
Purpose  is  illustrated  in  Figure  17-44. 
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TASKS /ACTIVITIES i  Finally/  we  naad  to  identify  those  individual 
component  tasks  we  need  to  accomplish  to  complete  or  realize  each  (parent) 
event*  Typically,  the  task  is  the  smallest  unit  of  activity  we  want  to 
manage,  i.e>,  the  greatest  level  of  resolution  (tertiary  subsystem).  The 
task  is  the  unit  to  which  resources  are  directly  applied,  and  the  level  at 
which  performance  is  measured  and  controlled.  Figure  17-45  illustrates  the 
complete  decision  structure. 
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The  number  of  levels,  or  degree  of  resolution,  will  vary  frost  project 
to  project  and  will  depend  on  the  degree  of  control  or  management  visibility 
required.  The  more  closely  we  want  to  oontrol  a  project,  the  greater  will 
be  the  number  of  levels  in  the  systems  analysis  structure.  Normally,  it  is 
not  too  meaningful  or  cost  effective  to  go  beyond  the  levels  of  resolution 
illustrated  here,  i.e.,  beyond  the  tertiary  subsystem  level. 


Closely  tied  to  these  DECISION  questions  Is  the  complementary  question 
of  CONTROL.  Essentially,  the  control  question  can  be  restated  as  "What  • 
evidence  will  we  accept  that  a  particular  task  has  been  completed  or  accom¬ 
plished  in  a  satisfactory  manner?"  This  becomes  a  question  of  measurement. 
More  specifically,  we  need  to  decide: 

a 

1 .  What  operational  scales  of  measurement  or  "performance  criteria” 
are  appropriate  for  a  particular  task  (e.g.,  time  allowed  for  task  comple¬ 
tion  in  man-days ,  the  compressive  strength  of  concrete  in  pel ,  etc . ) , 


2.  What  our  measurement  technology  and  methodology  will  be,  e.g., 
measurement  instrument,  measurement  frequency,  etc.,  and 


3.  What  level  of  performance  on  the  specified  operational  scale  or 
measure  is  "good  enough,”  i.e.,  what  our  operational  standard  of  perfor¬ 
mance  or  objective  is  (e.g.,  2.0  man-days  for  task  completion,  2,500  psi 
compressive  strength  at  seven  days  for  concrete ,  etc . ) • 


In  order  to  establish  appropriate  control,  and  to  be  consistent,  each 
system  task  should  have  at  least  one  and  very  often  many  operational 
measures  (with  associated  performance  standards).  In  many  of  our  projects, 
we  develop  extensive  plans  and  specifications  to  spell  out  exactly  the  per¬ 
formance  measures  to  be  used  and  the  standards  to  be  met  against  these 
measures.  For  many  other  projects,  our  performance  criteria,  operational 
measures,  and  operational  objectives  are  established  in  our  manuals,  regula¬ 
tions,  and  policy  letters. 


These  decisions  can  be  sunsarised  in  a  DECISION  and  CONTROL  MATRIX  for  a 
particular  project  system.  Notice  that  the  matrix  illustrated  in  Figure  17-46 
has  been  embellished  to  include  some  additional  management  information  of 
interest  to  the  project  system  manager,  e.g.,  the  OPR  for  each  task, 
resources  required,  current  status,  etc.  Completing  such  a  matrix  can  give 
the  manager  a  framework  for  systematically  analyzing  the  tasks  required  to 
complete  a  project  and  the  associated  measures  and  objectives  needed  to 
control  it. 


While  the  decision-control  matrix  facilitates  identification  of  project 
system  task  components  and  desired  system  behaviors  (as  expressed  in  the 
task  performance  criteria),  it  doesn't  provide  much  insight  into  the  rela¬ 
tionships  (technological,  economic,  organizational,  etc.)  between  various 
system  components.  Determining  these  relationships  depends  primarily  on 
the  knowledge,  experience,  and  perceptiveness  of  the  project  system  manager. 
However,  this  process  can  also  be  facilitated  by  constructing  a  matrix  of 
"precedence”  relationships. 


CRITERIA  1  CRITERIA 


In  the  matrix  illustrated  in  Figure  17-47,  various  symbols  (e.g.,  "T", 

"E" ,  and  "F*l  have  bean  used  to  denote  that  a  precedence  relationship 
exists  between  two  activities  and  the  nature  of  that  relationship*  For 
example ,  the  technological  constraint  or  relationship  indicated  by  the  "T* 
in  cell  (1.1.1,  1.1.3)  implies  that  for  technological  reasons,  task  1.1.1 
must  be  completed  before  task  1.1.3  can  be  started  (e.g.,  the  footing  must 
be  completed  before  a  wall  can  be  constructed).  The  economic  relationship 
indicated  in  cell  (1.2.1,  2.1.2)  implies  that  task  1.2.1  must  be  completed 
before  task  2.1.2  can  be  started  (e.g.,  both  tasks  cannot  be  accomplished 
concurrently  because  they  require  the  same  critical  piece  of  equipment  or 
key  Individual).  Obviously,  identification  of  the  key  relationships  bet¬ 
ween  system  components  is  critical.  When  such  relationships  are  not 
recognised  and  identified,  we  increase  the  risk  of  unforeseen  delays  and/or 
increases  in  cost. 

PROJECT  SYSTEM  SCHEDULING  MID  CONTROL 

Once  the  analysis  of  the  project  system  has  been  completed ,  i.e. ,  we've 
identified  all  of  the  component  tasks  which  need  to  be  controlled  and  their 
various  interrelationships,  we  complete  the  planning  process  by  identifying 
the  resources  required  and  by  developing  a  project  system,  completion  sche¬ 
dule.  These  stepe  are,  of  course,  closely  related  to,  and  integrated  with, 
the  system  analysis  phase  of  the  planning  process.  Completing  the 
decision-control  and  precedence  matrices  described  in  the  previous  section 
generally  require  the  manager  to  consider  the  resources  required,  as  well 
ae  those  available.  These  resource  considerations  often  influence  or 
determine  the  relationshipe  between  the  component  tasks  of  the  project 
system.  Very  often,  subeequent  thinking  about  available  and  required 
resources  forces  us  to  go  back  and  "fine  tune"  the  relationships  developed 
in  our  Initial  project  system  analysis. 

Having  completed  our  system  analysis  and  considered  the  availability 
and  allocation  of  required  resources,  we  also  need  to  develop  an  operating 
schedule  or  time-phased  plan  for  completing  the  project  and  accomplishing 
our  purpose.  Several  tools,  most  notable  the  bar  chart  and  the  activity 
network,  are  commonly  used  in  the  scheduling  process.  These  techniques  ar»- 
also  used  extensively  in  project  system  control.  Consequently,  these 
approaches  are  commonly  discussed  under  the  topics  of  planning  and/or 
control,  depending  on  the  perspective  of  the  respective  author. 

The  Bar  Chart 

_ The  bar  chart  (or  Gantt  chart)  is  perhaps  the-most-  widely  used- -project-  -  - 

system  scheduling  device.  A  typical  bar  chart  is  illustrated  in  Figure  17-48. 
This  chart  can  be  rather  easily  constructed  from  the  information  compiled 
in  the  project  system  analysis  phase  and  summarized  inthe  decision-control 
and  precedence  matrices  illustrated  in  Figures  17-46  and  17-47.  The "typical 
bar  chart  is  advantageous  in  that  it  is  relatively  easy  to  read  and  high¬ 
lights  activities  requiring  management  attention.  In  the  example  of 
Figure  17-48,  the  status  of  day  three  indicates  that  Task  1.2.1  is  in-progress 
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and  ahaad  of  schedule;  Task  1.1.1  has  been  completed  on  schedule;  Task 
1.1.2  Is  in -progress ,  but  behind  schedule;  and  Task  2.1.1  is  in-progress 
and  on-schedule.  This  basic  information  can  be  embellished  by  including 
other  useful  information,  e.g. ,  percent  complete  (for  each  task),  color 
coding  of  critical  activities,  etc.  On  the  other  hand,  the  bar  chart  has 
the  disadvantage  that  it  does  not  (normally)  clearly  show  all  of  the  task 
interrelationships.  For  example,  inspection  of  the  chart  in  Figure  17-48 
suggests  that  Task  2.2.2  must  also  be  proceeded  by  Tasks  1.2.1  and  1.1.2. 
This  limitation  can  be  offset  to  some  extent  by  an  additional  bookkeeping 
convention  or  notation  to  reflect  precedent  activities  (see,  for  example, 
the  bar  for  Task  2.2.2).  However,  this  procedure  tends  to  be  somewhat  bur¬ 
densome  for  large  projects. 

The  Project  System  network 

Perhaps  a  more  powerful  scheduling  and  control  technique  is  the  project 
system  network  or  activity  network  in  which  the  project  system  is  modeled 
as  a  network  or  graph  in  which  the  nodes  represent  events  and  incident  arcs 
represent  activities  which  must  be  completed  before  the  event  can  be  said 
to  be  completed.  A  typical  project  system  network  is  illustrated  in  Figure 
17-49.  The  project  system  network  has  the  advantage  of  more  clearly 
illustrating  the  interrelationships  between  various  system  components.  As 
such,  it  is  particularly  useful  to  the  manager  in  assessing  the  impact  of 
delays  in  the  completion  of  particular  tasks  or  conversely,  in  taking 
advantage  of  opportunities  created  by  the  early  completion  of  various 
system  tasks.  The  Program  Evaluation  and  Review  Technique  (PERT)  and  the 
Critical  Path  Method  (CPM)  are  two  commonly  used  project  system  management 
techniques  based  an  the  activity  network. 

The  project  system  network,  as  normally  constructed,  is  not  drawn  to  a 
time  scale  as  is  the  bar  chart,  posing  some  limitation  is  using  it  for  con¬ 
trol  purposes,  i.e.,  for  displaying  current  status.  It  is  possible,  however, 
to  construct  a  time-scaled  network  diagram,  as  shown  in  Figure  17-50. 

Organizing  and  Directing 

In  discussing  project  system  management,  there  seems  to  be  a  tendency  to 
concentrate  an  the  planning  and  control  processes,  while  giving  relatively 
less  emphasis  to  the  organizing  and  directing  processes.  This  happens  for 
a  number  of  reasons.  Very  often,  the  project  manager  doesn't  have  the 
authority  to  make  significant  changes  to  the  organization  structure  or  to 
make  desired  assignments  of  authority  and  responsibility  to  particular 
individuals.  Consequently,  there  is  a  tendency  to  treat  the  organization 
as  a  "given"  and  modify  the  plan  to  accommodate  this  constraint.  Even  if 
so  contrained,  however,  the  project  manager  should  at  least  consider  how 
the  organization  can  be  modified,  either  directly  or  indirectly,  to  facili¬ 
tate  project  system  accomplishment .  While  the  organizing  process  tends,  in 
general,  to  follow  the  planning  process,  these  management  activities  are 
obviously  interdependent.  In  developing  the  system  relationships  in  the 
analysis  phase  of  the  planning  process,  existing  (and  desired)  organiza¬ 
tional  relationships  must  be  considered. 


17-39 


Similarly,  success  in  implementing  the  project  system  plan,  through  the 
existing  or  modified  organization,  depends  on  how  well  the  project  manager 
communicates  the  purpose  and  the  details  of  the  plan  to  those  individuals 
responsible  for  accomplishing  the  various  component  tasks  in  the  project 
system.  The  directing/communicating  process  of  management  is  particularly 
critical  to  the  success  of  those  projects  in  which  close  coordination  is 
required.  Typically,  as  the  degree  or  amount  of  component  interaction, 
i.e.,  system  complexity  increases,  the  importance  of  effective  direction 
and  communication  also  increases.  In  a  sense,  the  directing  process  is 
particularly  important  in  that  it  is  through  this  management  process  that 
the  manager  "breathes  life”  into  the  system. 

An  additional  related  point  should  be  emphasized  concerning  effective 
project  management.  All  project  systems  depend  directly  or  indirectly  on 
the  actions  of  the  people  ooncemed  with  the  project.  Because  of  the  unique 
attributes  and  characteristics  of  human  resources,  the  project  manager 
needs  to  give  serious  consideration  to  motivational  factors  and  other  orga¬ 
nizational  behavior  consequences  associated  with  project  system  development 
and  implementation.  Very  often  the  difference  between  success  or  failure 
on  a  project  depends  to  a  great  extent  on  the  manager's  ability  to  involve 
and  motivate  the  people  involved.  In  fact,  quite  typical ly,  these  organi¬ 
zational  behavior  considerations  generate  the  most  difficult  problems  for 
the  project  manager  as  well  as  the  greatest  oportunities  for  success. 


The  general  process  of  project  system  management  is  illustrated  in  Figure 
17-51.  This  figure  attempts  to  reflect  the  dynamic  input-output  nature  of 
the  project  system  as  well  as  the  interrelated  nature  of  the  various  processes 
involved  in  management  of  the  system.  It  should  also  suggest,  at  least 
indirectly,  that  effective  project  management  is  a  very  complex  process, 
drawing  on  technical,  organizational ,  and  behavioral  management  skills. 
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PROJECT  SYSTEM  NETWORKS 


Basic  Definitions  and  Conventions 


1 .  An  ACTIVITY  is  any  portion  of  a  pro j act  which  consumes  time  or 
resources  and  has  a  definable  beginning  and  ending.  Activities  are  also 
referred  to  as  tasks  or  jobs.  Activities  are  graphically  represented  by 
directed  area  (arrows),  usually  having  the  associated  time  estimate  (and 
other  pertinent  information)  written  along  the  arc: 

Figure  17-52 

- 5 - 0 

2.  A  directed  arc  that  does  not  represent  the  consumption  of  time  or 
resources,  but  indicates  a  precedence  relationship  or  constraint  is  called 
a  DUMMY  ACTIVITY.  It  is  usually  represented  by  a  dashed-line  arrow: 

Figure  17-53 
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A  dummy  activity  can  also  be  represented  by  a  solid-line  directed  arc  with 
an  associated  zero  time  estimate: 

Figure  17-54 

- 2 - 1> 

3.  The  end  points  of  an  activity  arc  represent  EVENTS.  Theoretically, 
an  event  is  an  instantaneous  point  in  time.  Graphically,  events  are  repre¬ 
sented  by  nodes  or  circles.  If  an  event  represents  the  joint  completion  of 
more  than  one  activity,  it  is  called  a  MERGE  EVENT.  If  an  event  represents 
the  joint  initiation  or  more  than  one  activity,  it  is  called  a  BURST  EVENT: 


Figure  17-55 


BURST 


4.  An  activity  is  usually  identified  by  numbers  assigned  to  its 
beginning  and  ending  nodes: 

Figure  17-56 

C*) Activity  (1,  j!) 

By  convention/  and  to  facilitate  computer  modeling,  the  number  label 
assigned  to  node  (i)  is  always  smaller  than  the  number  label  assigned  to 
node  (j). 


5.  An  ACTIVITY  NETWORK  is  a  graphical  representation  of  a  project 
plan,  showing  the  project  components  (tasks  or  activities)  and  the  interre¬ 
lationships  (dependencies)  between  the  various  activities: 

Figure  17^57 


Basic  Network  Rules 


Rule  1.  Before  an  activity  may  begin,  all  activities  proceeding 
it  must  be  completed.  In  the  following  network  segment,  both  activities 
(1,  3)  and  (2,  3)  must  be  completed  before  (3,  4)  can  be  started: 


Figure  17-58 


Rule  2.  Arrows  (dlr*ct*d  ares)  represent  logical  precedence  only. 
Normally,  neither  the  length  of  the  arc  nor  Its  "compass"  direction  on  the 
graph  have  any  significance  (although  some  networks  are  time-scaled  to 
facilitate  display  of  control  information). 


Rule  3.  Event  numbers  must  not  be  duplicated  in  a  network  (i.e., 
loops  are  permitted): 

Figure  17-59 


NOT  PERMITTED 


Rule  4.  Any  two  events 
one  activity: 


may  be  directly  connected  by  no  more  than 
Figure  17-60 


Rule  5.  Networks  may  have  only  one  initial  event  (i.e.,  ORIGIN  or 
SOURCE)  and  only  one  terminal  event  (DESTINATION  or  SINE).  It  is  often 
desirable  to  represent  the  origin  and  destination  nodes  by  distinctive 
symbols: 

Figure  17-61 


ORIGIN  DESTINATION 


(SINX) 


Bas le  Scheduling  CflMgtg 


Ones  th«  project  system  baa  been  analyzed  to  determine  the  component 
activities  and  their  interrelationships ,  we  can  construct  the  project  network. 
The  next  step  (usually  done  concurrently  with  network  development)  is  to 
estinats  the  tiiM  required  to  complete  each  activity.  The  manner  in  which 
these  tine  estimates  are  determined  is  one  of  the  essential  differences 
between  ?SRX  and  CPM.  Briefly#  PERT  incorporates  probability  in  the  esti¬ 
mate  of  activity  times f  i.e. *  it  considers  the  estimated  completion  time  to 
be  soma  random  variable#  while  CPM  amploys  deterministic  estimates,  to 
introduce  soma  basic  scheduling  concepts#  we  will  first  adopt  the  deter¬ 
ministic  assumption.  To  facilitate  this  discussion#  consider  the  project 
system  network  of  Figure  17-63.  The  number  associated  with  each  activity 
arrow  is  the  deterministic  time  estimate.  Suppose,  as  the  project  manager# 
you  are  interested  in  determining  the  total  pioject  completion  time,  i.e. , 
the  earliest  time  all  activities  in  the  network  can  be  completed.  This  is 
equivalent  to  finding  the  EARLIEST  TIME  for  each  event.  Since  an  event 
cannot  occur  until  all  of  the  incident  activities  terminating  at  the  event 
have  been  completed#  the  earliest  time  of  event  completion  is  given  by: 

Ej  »  max  (E^  +  t^j) 

where  tjj  is  the  estimated  activity  completion  time  for  (i#  j).  Normally, 

Ej  is  indicated  OVER  the  node  to  which  it  applies : 


Figure  17-62 
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As  shown  la  Figure  17-64,  th«  earliest  time  is  16  units.  Saving  determined 
ths  ssrlisst  svan t/pro j set  collation  tins,  it  is  also  of  intsrsst  to  find 
the  LATEST  TZMS  for  svsnt  and  project  completion.  This  is  ths  latsst  tine 
s  particular  svsnt  can  occur  without  delaying  completion  of  ths  projsct  (in 
this  csss,  beyond  16  units).  The  latsst  completion  time  for  each  svsnt  is 
determined  by  working  "backwards”  through  ths  network  from  ths  sink  to  ths 
source  nodes.  Ths  latsst  time  is  given  by: 


min  (Lj  ~  tAj) 


normally,  ths  latsst  time  is  written  under  ths  nods. 


For  example: 


Figure  17-65 


Node  8  (Sink) 
H  -  *i 

Lg  »  Eg  ■  16 


As  another  example: 

Node  5 

L5  -  L?  -  t5 
-  13  -  0 
L5  -  1^  -  ts 
>16-8 


Inspection  of  the  network  shows  that  for  certain  events  (nodes),  E,  -  L., 
the  earliest  and  latest  event  completion  times  are  the  same.  This  means 
that  there  is  no  "SLACK  TIME"  available.  If  the  event  is  not  completed  on 
time,  the  project  completion  time  will  also  slip.  For  other  events,  the 
latest  time  exceeds  the  earliest  time.  At  these  events,  a  slip  in  the 
event  completion  time  of  14  -  E*  can  be  incurred  without  impacting  the 
completion  time  of  the  project.  This  difference,  l.e.,  I^  -  S^,  is  called 
EVENT  SLACK.  Events  having  saro  slack  are  said  to  be  CRITICAL.  The  path 
through  the  network  from  the  source  to  the  sink  connecting  critical  event*-, 
is  called  the  CRITICAL  FATE.  It  is  the  longest  path  through  the  network 
(in  this  case:  0-1,  1-4,  and  7-8).  The  activities  dsfialng  this  path 
(0-1,  1-4,  4-7,  and  7-8)  are  called  CRITICAL  ACTIVITIES.  This  information 
can  be  summarised  in  a  table  to  facilitate  project  system  management  and 
control.  An  example  is  presented  in  Table  17-28.  The  table  is  particularly 
useful  in  that  it  highlights  the  critical  activities  and  events  and  indi¬ 
cates  which  activities  can  be  slipped  without  impact  on  the  project  comple¬ 
tion  time.  Figure  17-66  summarises  these  computations. 
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Figur*  17-66 


17.3  PROGRAM  EVALUATION  AND  REVIEW  TECHNIQUE  (PERT) 


5 

9 

i 


PERT  was  developed  by  a  team  of  consultants  from  Lockheed,  the  Navy,  and 
the  firm  of  Booz,  Allen  and  Hamilton  in  1958  (during  the  Polaris  project). 

It  was  developed  to  help  project  managers  deal  with  an  extremely  critical 
time  constraint.  Prior  to  that  time,  completion  times  on  similar  projects 
commonly  averaged  30-40%  greater  than  the  earliest  time  estimates,  while 
costs  commonly  ran  two-three  times  the  earliest  estimates.  Perhaps  the 
most  distinctive  feature  of  PERT  is  its  statistical  treatment  of  the  esti¬ 
mated  activity  time,  i.e.,  the  activity  time  estimate  is  treated  as  a  ran¬ 
dom  variable.  It  relaxes  the  assumption  of  certainty  used  in  CPM  and  other 
project  management  techniques.  Activity  times  are  represented  by  a  mean 
and  associated  variance.  By  employing  a  statistical  approach,  the  project 
manager  can  determine  the  probability  of  completing  a  particular  activity 
or  event  by  some  scheduled  completion  date  or  other  date  of  interest.  To 
estimate  the  activity  mean  and  variance,  PERT  uses  the  following  three  time 
estimates  for  each  activity: 

a^j  ■  OPTIMISTIC  TIME:  The  minimum  reasonable  time  in  which  the  activity 
can  be  completed  (there  is  only  a  0-5%  chance  that  the  activity  can  be 
completed  in  a  shorter  period  of  time) 

n»ij  -  MOST  LIKELY  TIME:  The  best  guess  of  the  time  required  to  complete 
the  activity.  This  would  be  the  estimate  used  if  the  deterministic  assump¬ 
tion  were  made.  This  is  the  modal  value  of  the  distribution  of  activity 
times. 

bij  -  PESSIMISTIC  TIME:  The  maximum  reasonable  period  of  time  in  which 
the  activity  can  be  completed  (there  is  only  a  0-5%  chance  that  the  activity 
will  take  a  longer  period  of  time  to  complete). 

In  PERT,  the  estimated  activity  completion  time  is  assumed  to  have  a  BETA 
distribution  (unless  other /historical  information  suggests  otherwise).  The 
BETA  distribution  was  selected  because: 


1 .  it  is  extremely  flexible 

2.  it  can  take  on  the  variety  of  forms  that  typically  arise  in  pro¬ 
ject  activity  durations 


3.  it  has  finite  end  points  which  limit  the  possible  activity  times 
to  the  area  between  the  aj.j  and  bj.j  values,  and  primarily 


4.  in  the  simplified  version  used  in  PERT,  it  permits  the  straight' 
forward  computation  of  the  mean,  variance,  and  standard  deviation. 


The  mean  is  given  by: 

The  variance  is  given  by:  V^j 


aij  +  *®ij  ♦  bij 
6 

<bij  “  aij)2 
36 
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To  compute  the  EARLIEST  and  LATEST  event  times  under  PERT,  we  proceed 
as  before ,  but  use  the  te  values  Instead  of  the  deterministic  activity  time 
estiaatet 


and 


max 

i 


(E, 


*•11 } 


*  min  (L.  - 


Te  } 
eij 


Again  the  SLACK  for  event  (i)  is  given  by  Li  -  Si*  It  should  be  noted  that 
these  computations  assume  that  the  activities  in  the  network  are  statisti¬ 
cally  independent*  Just  as  the  earliest  event  .time  is  given  by  the  sum  of 
the  te  values  along  the  longest  path  from  the  origin  to  the  event  node,  the 
variance  in  that  event  time  is  given  by  the  sum  of  the  associated  activity 
variance  values  along  the  same  path.  Consider  the  following  network! 


Figure  17-67 


Wa  cooputa  tha 


(t#)  and  varianca  (V)  with  tha  raspactiva  formula: 
-  b  and  V  -  SkzJ ± 


Tabla  17-30 


rigura  17-68  show*  tha  natwork  with  all  actl~itiaa  and  avants  indicating 
thair  raapactiva  naan  valuas  and  aasociatad  variancaa  aa  (  ,  V) . 
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Because  PERT  treats  the  estimated  activity  time  as  a  random  variable, 
it  is  possible  to  make  statements  about  the  probability  of  completing  an 
event  on  or  before  some  scheduled  date*  By  employing  the  Central  Limit 
Theorem,  we  can  assert  that  the  actual  time  of  event  occurrence  is  NORMALLY 
distributed  with  a  mean  of  and  a  variance  of  •  Therefore ,  if  we 
define  Si  as  some  scheduled  date  for  event  completion,  then: 


Si  -  Ei 


Suppose,  for  example,  the  project  manager  wants  to  know  the  probability 
of  completing  the  project  system  illustrated  in  Figure  17-68  in  18  days  or 
less*  We  have  previously  determined  that  2^-15  and  V4  -  5*34.  Therefore 


18  -  15 


2.311 


»  1.298 


Figure  17-69 


Q.4Q29 


1.298 


Therefore,  for  S4  -  18,  we  can  assert  that  P(*4  S  18)  ■  0.500  +  Q.4029  " 
0.9024,  or  there  is  a  .  90%  chance  that  the  project  will  be  completed  in  18 
days  or  less. 


CPM  and  Time  -  Cost  Analysis 

The  original  versions  of  PERT  and  CPM  differ  in  at  least  two  important 
ways: 

1.  First/  CPM  assumes  that  activity  times  are  deterministic* 

2.  Second,  rather  than  focusing  primarily  or  even  exclusively  on  the 
time  factor,  CPM  places  equal  emphasis  on  both  time  and  cost. 

The  network  scheduling  logic  is  the  same  for  both  CPM  and  PERT.  The 
emphasis  on  EARLY/LATE  START  and  EARLY/LATE  FINISH  times  for  individual 
activities  comes  from  CPM. 


CPM  assumes  that  there  is  a  relationship  between  the  time  it  takes  to 
complete  an  activity  and  the  cost  for  that  activity,  i.e.,  the  completion 
time  can  be  reduced  by  some  amount  by  allocating  more  resources  to  the 
activity.  In  most  cases,  this  relationship  is  assumed  to  be  linear: 


Figure  17-70 


ACTIVITY 

COST 


NORMAL 


CRASHED 


Th«  slop*  of  tfa*  tin*  -  cost  function  expresses  the  MARGINAL  EFFECTIVENESS 
or  UTILITY  of  allocating  additional  resources  to  reduce  the  activity  tine. 
Using  the  notation  introduced  in  the  above  table/  the  slop*  Is  given  by: 

CCij  “  NCij 

SiJ  ’  STij  -  cr13 

Consider/  for  example,  the  system  project  data  summarized  in  the  following 
table: 


Table  17-31 


ACTIVITY 

NTij 

CTij 

NCij 

V 

CRITICAL 

0  -  1 

2 

1 

10 

13 

3 

0-2 

1 

1 

6 

6 

- 

X 

0-3 

4 

2 

25 

33 

4 

1  -  3 

1 

1 

7 

7 

- 

1  -  4 

5 

2 

30 

36 

2 

2-3 

2 

1 

12 

19 

7 

2-5 

7 

3 

35 

55 

5 

X 

3-4 

3 

1 

15 

17 

1 

3-5 

2 

1 

8 

14 

6 

3-6 

3 

1 

10 

16 

3 

4-6 

4 

2 

14 

18 

2 

5-6 

6 

2 

40 

56 

4 

X 

Under  normal  conditions  then,  the  critical  path  isO-2-5-6  with  a 
normal  total  time  of  14  and  a  normal  total  cost  of  212. 


It  seems  apparent  that  if  the  project  manager  wants  to  expedite  the 
project  completion  by  allocating  additional  resources,  the  first  activities 
to  be  considered  are  those  currently  on  the  critical  path,  i.e. ,  those  that 
are  currently  constraining  the  project: 

ACTIVITY  S _  i 


0-2  -  (can't  be  crashed) 

2-5  5 

5-6  4 

Of  these  critical  activities,  we  would  want  to  crash  activity  5-6  first 
because  it  has  the  least  expensive  unit  cost  64.  But  how  far  can  activity 
5  -  6  be  crashed,  i.*.,  all  the  way  from  6  to  2,  or  only  some  portion  of 
the  total  "crash  potential"'’  He  crash  the  most  cost-effective  or  efficient 
critical  activity  until  it  is  no  longer  critical,  or  by  the  entire  crash 
potential  amount  if  the  activity  remains  critical.  In  this  particular 
case,  activity  5-6  can  be  reduced  by  three  time  units  at  an  additional 
cost  of  3  6  4  -  12.  At  that  point,  activity  4-6  becomes  critical  and  can 
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Pigur*  17-72 


be  crashed  at  a  cost  of  2/time  unit.  To  determine  the  earliest  possible 
completion  time  under  a  "full  crash"  program,  we  simply  continue  to  crash 
the  most  cost-effective  critical  activity  until  it  is  no  longer  critical. 
For  even  moderate-sized  project  networks,  it  is  difficult  to  systematically 
crash  a  project  system  without  the  assistance  of  a  computer  code.  Figure 
17-72  summarizes  a  full  crash  program  for  this  project.  Note  that  it  is 
not  necessary  to  crash  every  activity  by  the  full  amount  in  order  to 
achieve  the  maximum  possible  reduction  in  completion  time.  In  this  case, 
crashing  the  network  from  a  total  cost  of  277  to  290  doesn't  result  in  any 
further  reduction  in  project  completion  time. 
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CHAPTER  18 


INVENTORY  MODELS 


INTRODUCTION 

This  chapter  presents  the  basic  models  developed  for  the  management  of 
Inventory  where  Inventory  will  be  referred  to  as  the  stock  of  any  indepen¬ 
dent  demand  Item  used  in  the  conduct  of  an  organization 1 s  mission  or  opera¬ 
tion.  Inventory  serves  many  purposes,  chief  of  which  include:  decoupling 
successive  stages  in  production,  distribution  and  consumption  processes; 
protection  against  variations  in  item  demand  and/or  delivery  schedules; 
and  reduction  of  aggregate  inventory  costs  through  establishment  of 
economic  purchase  order  replenishment  levels. 

Inventory  models  are  used  to  assist  inventory  managers  in  determining 
when  items  should  be  ordered  and  how  large  the  order  should  be  so  as  to 
minimize  relevant  variable  costs,  while  ensuring  that  adequate  stocks  are 
on  hand  to  meet  mission  requirements. 

EOQ  MODEL 

The  classic  economic  order  quantity  or  EOQ  model,  which  serves  as  the 
foundation  for  the  more  complex  but  realistic  inventory  models,  is  used 
under  the  following  ideal  conditions^-: 

1.  Demand  per  time  period  is  known  with  certainty  and  is  at  a  constant 
(linear)  rate. 

2.  Lead  time  is  known  with  certainty  and  equals  zero. 

3.  Stockouts  are  not  permissible . 

4.  Replenishment  is  instantaneous. 

5.  Order  quantities  are  always  the  same  size  and  are  not  restricted  to 
being  discrete. 

6.  Unit  cost  for  items  is  constant  with  no  discounts. 

7.  Single-item,  single  echelon  inventories  are  assumed. 

8.  An  infinite  planning  horizon  is  assumed. 

9.  Demand,  lead  time  and  costs  are  fixed  over  time. 


1 These  conditions,  or  assumptions ,  and  the  notation  used  in  this 
chapter  are  taken  from  Frank  S.  Budnick,  Richard  Mojena,  and  Thomas  E. 
Vollmann,  Principles  of  Operations  Research  for  Management  (Homewood,  II: 
Richard  D.  Irwin,  Inc.,  1977),  Chapter  11. 
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The  objective  of  the  EOQ  model  is  to  detexmine  the  optimum  economic 
order  quantity  (Q*)  and  number  of  orders  (N*)  to  place  per  given  time 
period  so  as  to  minimize  total  inventory  costs  given  by: 


Total  Cost  ■ 


Ordering  Cost 
per  Time  Period 


+ 


Carrying  Cost 
per  Time  Period 


+  Purchase  Cost 


or 


££  .  2jQ 
Q  2 


+  PD 


(1) 


where  TC  -  Total  inventory  cost 

D  »  Demand  per  time  period 
C0  *  Order  cost  per  order 
Q  -  Order  quantity 

Ch  -  Carry  or  holding  cost  per  unit  per  time  period 
P  -  Purchase  price  per  unit 


The  behavior  of  the  inventory  cost  components  is  illustrated  in  figure  1. 


Figure  1. 


Inventory  Costs 


To  minimize  the  total  cost  equation,  we  take  the  first  derivative  of  the 
expression  with  respect  to  Q,  set  the  result  equal  to  zero  and  solve  for  Q: 


Note  that  the  purchase  cost  component  falls  out  of  the  final  result  since 
the  price  per  unit  was  assumed  fixed  with  respect  to  the  order  quantity. 
Other  important  results  for  this  model  are: 


where  N*  *  the  optimum  number  of  orders  to  place  per  time  period 

t*c  ■  the  inventory  cycle  time  or  time  between  order  replenishment 
and  depletion. 

EXAMPLE  18-1. 

under  the  consolidated  administrative  airlift  program,  109  T39 
aircraft,  operated  by  MAC,  are  located  throughout  the  CONUS  to  provide 
pilot  training  and  passenger  movement  capability .  Main  gear  tires  for 
these  aircraft  are  procured  and  managed  at  the  Ogden  Air  Logistics  Center. 
The  item  manager  is  interested  in  establishing  an  inventory  policy  for 
these  tires  (that  is,  how  many  tires  to  order  each  time  an  order  is  placed 
and  how  many  orders  should  be  placed  each  year) .  The  annual  requirements 
are  estimated  at  2250  tires.  It  costs  $25  to  place  an  order  and  the  cost 
of  carrying  one  tire  in  inventory  for  one  year  is  $5.  Lead  time  is  zero 
days  since  the  supplier  can  deliver  tires  to  the  Ogden  ALC  on  the  order 
date.  The  purchase  cost  is  $50  per  tire. 


For  this  problem,  Q*  is  determined  from  equation  (2) : 


B.  .jnnnnEi  .  150tir.. 


The  optimum  number  of  orders  to  place  is  given  by  equation  (3) : 


N* 


2250 

150 


15  orders  per  year 


The  cycle  time  is  provided  from  equation  (4) : 


_  _  150 

Tc  2250 


.067  year  or  24.333  days 


The  total  inventory  cost  of  this  policy  is  given  by  equation  (1) : 
TC  »  25{|f'5-)  ♦  -7°--  +  50(2250)  «  $113,250 

EOQ  Model  with  Deterministic  Lead  Time 


In  most  inventory  systems,  the  time  between  order  placement  and  receipt 
is  not  zero  days,  as  assumed  in  the  classic  EOQ  model.  As  such,  order  place¬ 
ment  must  be  timed  so  that  the  order  arrives  when  the  inventory  on  hand 
is  depleted.  The  point  in  time  at  which  the  order  is  made  is  referred  to 
as  the  reorder  point  which  is  commonly  expressed  as  a  predetermined  inven¬ 
tory  level.  With  deterministic  lead  times,  the  reorder  point  equation  is: 


where  tj,  ■  lead  time 

0  *  demand  (in  lead  time  dimensions) 

D  •  t^*  demand  during  lead  time 

integer  part  of  ti/tc  or  the  number  of  complete  cycles  during 
a  lead  time  period. 


EXAMPLE  18-2 


For  the  aircraft  tire  problem,  suppose  that  the  lead  time  is  30  days, 
that  is,  it  takes  30  days  from  the  time  the  order  is  placed  until  the  order 
arrives.  From  equation  (5)  we  have: 
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.  2250  tires 
365  days 


30  days 


150  tires 


[~  30  days 
[24.333  days 


-  184.9  tires  -  (1) (150  tires) 

-  34.9  tires 


the  optimal  policy  is,  therefore,  to  place  an  order  for  150  tires  (Q*) 
when  the  inventory  on  hand  drops  to  35  tires. 

EOQ  Model  with  Backorders 


The  classic  EOQ  model  can  be  extended  to  explicitly  consider  the  case 
where  shortages  are  permissible.  With  shortages  or  backorders  allowed,  all 
demands  will  continue  to  be  satisfied  but  not  in  the  inventory  cycle  in 
which  they  were  made.  Although  difficult  to  measure,  a  backorder  cost 
should  be  included  in  the  total  inventory  cost  equation  and  policies  developed 
so  as  to  balance  this  cost  against  the  order  and  carry  cost  components. 

The  total  cost  equation  can  be  expressed  as: 


TC  - 


CpD 

Q 


+ 


(Q-S)2  Ch  S2Cs 
2Q  +  2Q 


PD 


(6) 


where  S  **  units  short  that  must  be  backordered  during  each  inventory  cycle. 
Cs  •  shortage  (backorder)  cost  per  unit  per  time  period. 

The  policy  variables,  Q*  and  S* ,  are  determined  by  differentiating  the 
total  cost  equation  with  respect  to  Q  and  s,  setting  the  resulting  partial 
derivatives  equal  to  zero  and  solving  for  Q*  and  S*  simultaneously.  The 
results  are: 


(7) 


(.81 


Other  results  for  the  EOQ  model  with  back  orders  are: 
L*  m  q*  -  s* 


(.91 


*1  “ 


ill 

D 


G101 
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R*“  “t-LVI  c*-s* 


where  L*  *  real  (maximum)  Inventory  level 

-  time  during  an  inventory  cycle  when  inventory  is  held. 
t2  *  time  during  an  inventory  cycle  when  inventory  is  backordered. 

EXAMPLE  18-3. 

For  the  aircraft  tire  problem,  suppose  that  backorder  costs  are  $40 
per  tire  per  year.  The  policy  variables  and  other  results  are  as  follows: 


1/2(25) (2250) 

/ 5+40 \ 

V  5 

\  40  / 

f 2(25) (2250) (5) 
(5) (40) +(40) ^ 


«*  2250 

N  "  155 7T 


159.1 

2250 


159.1  - 

141.4 

2250 


17.7 

2250 

2250  tires 
365  days 


—  \~40~j  "  159.1  tires  in  each  order 

^  ■  17.7  tires  backordered  each  cycle 

(40)  2 

m  14.1  orders  per  year 

*  .0707  year  or  25.8  days 

17.7  *  141.4  tires 

■  .0628 .year  or  22.9  days 

■  .0079  year  or  2.9  days 


30  days 


.[12. 

Las. 


days 
8  days. 


159.1  tires  -  17.7  tires 


184.9  -  159.1  -  17.7 


8  tires 


25(2250) 

159.1 


5(159.1  -  17.7)2 
2  (159.1) 


2  (159.1) 


+  50(2250) 

-  $113,207.11 

The  results  suggest  that  approximately  14  orders  should  be  placed  each 
year,  each  consisting  of  159  tires.  The  number  of  backorders  planned  for 
each  cycle  are  18  tires  and  an  order  should  be  placed  when  the  inventory 
drops  to  8  tires.  Mote  that  orders  placed  in  cycle  k  arrive  in  cycle 
k+2.  The  total  number  of  backorders  per  year  are : 


17.7  (14.1) 


249.57  units 


and  the  total  days  that  backorders  exist  during  the  year  are: 
t2N*  »  2.9  (14.1)  -  40.9  days 


Model  with 


itity  Discounts 


For  certain  items,  suppliers  offer  price  discounts  if  buyers  make 
purchase  orders  of  large  quantities.  With  these  discounts,  per  unit 
purchasing  cost  and  total  ordering  costs  decrease  while  carrying  costs  will 
increase.  The  following  example  will  illustrate  a  simple  total  cost  com¬ 
parison  procedure  for  determining  if  quantity  discounts  should  be  taken.- 

EXAMPLE  18-4 


Suppose  for  the  aircraft  tire  problem,  the  supplier  offers  the 
following  discount  schedule: 

Purchase  Price 

tity  Discount  per  Unit 
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TC 


25(2210!  +  SfigOL  +  50(225Q) 


$113,250 


The  next  step  Is  to  recompute  the  total  Inventory  cost  using  the  next 
lower  discount  price  of  $49.00.  Note  that  the  only  way  total  costs  can 
decrease  is  if  the  buyer  purchases  the  minimum  quantity  Q  associated  with 
the  2%  discount  because  of  the  shape  of  the  total  cost  curve.  With  Q  *  200 
and  P  ■  $49.00,  the  total  inventory  cost  from  equation  (1)  is: 

tc  -  35I|||02.  +  !I|gQL  +  49(2250)  -  $111,031.25 

The  above  step  is  repeated  through  the  entire  discount  schedule.  For  the 
3%  discount,  Q  is  set  equal  to  800  and  P  =  $48.50.  The  total  cost  for  this 
policy  is: 

TC  *  ~80o"°"  +  5 ' ' f" ~  +  48.5(2250)  -  $111,195.31 


The  last  step  is  to  select  that  policy  which  corresponds  to  the  minimum 
total  cost.  For  this  example,  the  minimum  total  cost  is  $111,031.25  which 
is  associated  with  an  order  quantity  of  200  tires  and  a  purchase  price  of 
$49.00. 

EOQ  Model  with  Stochastic  Demand 

In  most  inventory  systems,  including  those  used  in  the  USAF,  the  demand 
and  lead  time  parameters  are  stochastic  rather  than  deterministic,  as 
assumed  by  the  classic  EOQ  model.  For  the  fixed  order  quantity  (EOQ)  model 
without  backorders,  demand  variability  during  lead  time  and/or  lead  time 
variability  could  result  in  an  unprogrammed  shortage  occurring  prior  to 
inventory  replenishment.  To  protect  against  these  shortages,  a  safety 
level  quantity  or  safety  stock  is  added  to  the  demand  during  lead  time 
quantity  for  computation  of  the  reorder  point.  The  safety  stock  is  commonly 
established  as  a  policy  variable  derived  from  management  assessment  of  the 
risk  of  incurring  backorders  and  knowledge  of  the  distribution  of  demand 
during  lead  time  parameter . 

This  section  will  discuss  the  stochastic  demand  during  deterministic 
lead  time  situation.  It  is  recognized  that  lead  time  may  also  be  stochastic 
although  there  are  some  management  controls  over  this  parameter  to  reduce 
the  uncertainty.  If  both  demand  during  lead  time  and  lead  time  are 
stochastic,  the  distribution  of  demand  during  lead  time  is  difficult  to 
mathematically  determine  since  a  convolution  of  the  two  parameters  is 
required  to  generate  the  function.  However,  estimates  of  this  distribution 
can  be  made  by  experimentally  observing  the  interaction  of  demand  and  lead 
time  through  a  simulation  model  of  the  two  processes. 

The  EOQ  model  and  reorder  point  equations  are  modified  as  follows  to 
incorporate  stochastic  demand  during  deterministic  lead  time: 
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where  d 


average  daily  demand  determined  from  the  demand  distribution 
carry  cost  per  unit  per  day 


Once  Q*  is  computed,  the  reorder  point  is  computed  independently  as  follows 
(this  equation  assumes  that  demand  during  lead  time  can  be  described  by 
a  normal  distribution) . 


R* 


dL  +  2l-a 


% 


(15) 


where  dj,  =  average  demand  during  lead  time 

*  the  standard  deviation  of  demand  during  lead  time 

Z 

1— a  =  number  of  standard  normal  deviates  needed  to  achieve  a 
cumulative  probability  (service  level)  of  1-a 

Zl-a°d  =  safetY  stock  or  safety  level  quantity 


EXAMPLE  18-5. 


For  the  aircraft  tire  problem  suppose  that  the  average  daily  demand  is 
now  6  tires  per  day,  the  order  cost  is  $25  per  order  and  the  carry  cost  is 
$5  per  unit  per  year.  The  economic  order  quantity  is  computed  from 
equation  (14)  (note  that  the  per  unit  order  cost  is  divided  by  365  to 
convert  all  dimensions  to  days) : 


147.99  or  148  tires 


This  result  suggests  that  each  order  will  be  for  148  tires  and 


N* 


d  x  365 
148 


14.8  or  15  orders  will  be  placed  per  year. 


For  computation  of  the  reorder  point,  assume  that  management  establishes 
a  stock  out  risk  of  10%.  That  is,  if  100  random  observations  were  taken 
during  the  resupply  time  period,  in  10  of  those  observations  at 
least  one  shortage  would  be  expected.  The  conqplement  of  the  stock  out  risk 
is  90%  (1-10%)  which  is  referred  to  as  the  service  level,  that  is,  in  90 
out  of  100  random  observations  taken  during  the  resupply  time  period,  no 
backorders  or  stock  outs  would  be  expected  to  incur.  Thus  a  *  .10  and 


The  following  information  has  been  collected  for  computation  of  dL  and 


°d_ 


Class  Intervals  for 

Class  Mark: 

Frequency  of 

Demand  During 

Lead 

Demand  During  Lead 

Occurrence 

Time 

Time  (dLi) 

F(dL.) 

7  but 

under 

9 

8 

.10 

9  but 

under 

11 

10 

.20 

11  but 

under 

13 

12 

.40 

13  but 

under 

15 

14 

.20 

15  but 

under 

17 

16 

.10 

average 

demand  during  lead  time  (mean  of  a  frequency  distribution)  is 

computed  from  the  following  statistical  formula: 

»  ?  dLi  f(dL.) 

=  8  ( .10)  +  10  ( .20)  +  12  ( .40}  +  14  ( .20)  +  16(.l0)  =■  12 

The  standard  deviation  of  demand  during  lead  time  ( standard  deviation  of  a 
frequency  distribution)  is  computed  from  the  following  statistical  formula: 

o£  (dL  -  dL)2  f(dL  ) 

L  L1  1  • 

[(8-12)2(.10)  +  (10-12)2(.20)+(12-12)2{.40)  +  (14-12)2(.20)  +  (16-12)2(.10)J  *  =  2.! 

Given  a  *  .10,  then  Z^_a  *  Z  gQ  *  1.282  and  the  reorder  point  is  determined 
from  equation  (15) : 

R*  *  12  +  1.282  (2.19)  *  14.8  or  approximately  15  tires. 


Therefore,  an  order  for  148  tires  will  be  placed  each  time  the  inventory 
falls  to  15  units  on  hand.  Note  that  the  safety  stock  is  2.8  or  approximately 
3  tires  which  is  expected  to  provide  a  service  level  of  90%  or  alternatively, 
with  a  safety  level  of  3  tires,  the  probability  of  having  less  than  zero 
tires  on  hand  when  am  order  is  received  is  less  than  10%. 

Periodic  Review  Inventory  Systems 

The  models  discussed  up  to  this  point  are  used  in  continuous  review 
inventory  systems  where  on  hand  inventory  is  continuously  reviewed  and  an 
order  is  placed  for  a  fixed  quantity  Q*  when  the  on  hand  level  is  observed 
to  reach  the  reorder  point  R*.  Periodic  review  inventory  systems  require 
the  review  of  on  hand  inventory  levels  every  tc  time  units  at  which  time 
an  order  is  placed  for  a  variable  quantity,  q,  to  bring  the  inventory  level 
up  to  a  predetermined  replinishment  level,  M. 
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1 


For  the  periodic  review  inventory  system  with  stochastic  demand  during 
deterministic  lead  time,  Q  is  computed  from  equation  (14)  and  tc  is  computed 
as  follows: 


From  the  example  problem  presented  in  the  last  section,  this  equation 
gives : 

tc  -  ~  -  24.67  days 

The  periodic  review  system  requires  more  safety  stock  than  the  con¬ 
tinuous  review  system.  Safety  stock  for  the  continuous  review  system  was 
computed  using  the  probability  distribution  of  demand  during  lead  time. 
Safety  stock  for  the  periodic  review  system,  however,  is  computed  using  the 
probability  distribution  of  demand  over  a  longer  time  period,  namely,  lead 
time  plus  the  review  period.  The  replenishment  level,  which  accounts  for 
this  increased  safety  stock,  is:2 

M  =  d  +2.  or  (17) 

t.+te  l-o  d  .  t 

L  C  V  c 


where 


V*c 


d(tL  +  tc) 


z.  a— 
l-o  d. 


safety  stock 


tL+tc 


dtL«c  d  L  ° 


To  illustrate  the  application  of  equation  (17) ,  let  the  average  daily 
demand  be  6  units  per  day,  the  lead  time  be  2  days  and  the  standard 
deviation  of  average  daily  demand  be  1  unit  per  day  for  the  example 


problem.  If  o  *  .10,  z^_a 


1.282  and  the  replenishment  level  is: 


M  =  6(24.67  +  2)  +  1.282  (1)  (24 .67  +  2)"* 

-  160  +6.6 

=  +  166.6  or  approximately  167  tires 

The  order  quantity  for  this  sytem  cannot  be  a  fixed  quantity  as  it  was 
for  the  continuous  review  system.  The  reorder  policy  must  be  adaptive  and 
based  upon  the  on-hand  inventory  (Z)  at  the  time  of  the  review.  The  reorder 
quantity  is  given  by:  q  ■  M-I. 

2 Richard  B.  Chase  and  Nicholas  J.  Aquilano,  Production  and  Operation  Manage¬ 
ment  (Homewood  IL:  Richard  D.  Irwin,  Inc.,  1977)  pp.  388-389. 
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The  Air  Force  EOQ  Model 

The  EOQ  model  used  in  the  Air  Force  supply  system  is  applied  to  items 
which  cannot  be  economically  repaired  by  a  field  or  depot  maintenance  acti¬ 
vity.  As  covered  in  Chapter  11,  AFM  67-1,  these  items  include  consumables 
minor  parts,  components,  tools  and  hardware.  The  model  is  derived  from  a 
total  cost  equation  given  as: 


TC  - 


CqDDR 


+  I-g-Q 


(18) 


where  DDR  -  daily  demand  rate  -  SaSS^y*  £*3  *******  - 

current  date  -  date  of  first  demand 

I  a  50%  carry  cost  rate 

CQ  *  $5  per  order 

The  resulting  EOQ  model  is 

Eoe  -  (19) 


where  VSO  is  a  variable  stockage  objective  days  weighting  factor.  The 
safety  level  quantity  is  given  by: 


SLQ  -  C  V3(0  &  STQ)  (20) 

where  c  ■  the  number  of  standard  deviations  to  be  covered  by  the  SLQ 
which  is  normally  set  equal  to  1  giving  a  84%  service  level. 

0  &  STQ  ■  the  order  and  ship  time  quantity 
Finally,  the  reorder  point  is  given  by:  ROP  -  SLQ  +  0  &  STQ  (21) 
Recoverable  Item  Model 

The  continuous  review  inventory  system  model  can  be  easily  adapted  to 
describe  the  recoverable  item  control  system  used  by  the  Air  Force.  In 
fact,  the  modification  is  only  a  redefinition  of  lead  time.  Recoverable 
itesHi  are  repaired,  not  disposed  of,  when  they  fail.  Lead  time  for  such 
a  system  is,  then,  the  resupply  time.  Resupply  time  is  defined  to  be  the 
time  from  failure  of  an  item  until  the  item  is  either  repaired  at  the 
local  level  and  returned  to  supply  or  is  replaced  by  a  one-for-one  trade 
transaction  with  the  depot  repair  activity.  The  average  resupply  time,  T, 
is  given  in  abbreviated  form  by: 


.2 


1 


* 

► 

9 

% 


< 


I 


1 


T  -  r  (Rg)  ♦  (1-r)  Rq  (22) 

where  «  average  repair  tine  at  the  base 

R  “  average  replacement  time  from  the  depot  (order  and 
ship  time) 

r  ■  probability  that  the  item  will  be  repaired  at  base  level 

1-r  «  Not  Reparable  This  Station  (NRTS)  rate. 

The  repair  cycle  quantity  (RCQ)  is  then  confuted  as  follows: 

RCQ  »  DDR  •  PER  •  T  (23) 

where  PBR  is  the  percentage  of  time  (decimal  equivalent)  that  a  given 
item  is  repaired  at  base  level.  Finally,  the  order  and  ship  time  quantity 
is  computed  as  follows: 

0  &  STQ  -  DDR  (1-PBR)  *  0  &  ST  (24) 

Concluding  Remarks 

Inventory  models  which  support  continuous  review  and  periodic  review 
systems  have  been  presented  in  this  chapter.  Most  organizations  with  a 
substantial  investment  in  inventory  find  it  economical  to  stratify  the 
items  they  stock  according  to  some  simple  classification  scheme,  such  as 
the  "A-B-C"  scheme.  Items  which  fall  into  category  A  typically  make  up  10% 
of  the  total  inventory  volume,  but  account  for  as  much  as  70%  of  total 
investment.  These  are  high  value  items  and  usually  require  a  continuous 
review  system  model  to  control  their  activity.  The  remaining  items  can  be 
controlled  adequately  with  a  periodic  review  system. 

The  models  presented  do  have  shortcomings,  chief  of  which  is  their 
limitation  to  single  item  inventories.  When  applied  to  large  multi-item 
systems,  they  do  not  address  the  interactive  nature  of  such  systems.  The 
mathematical  models  are  useful  for  gaining  some  valuable  theoretic  insights 
into  system  behavior  and  performance,  but  lack  the  ability  to  deal  with 
the  enormous  complexity  found  in  real  systems.  To  deal  with  the  complex 
multiple  item,  multiple  echelon  systems,  we  can  return  to  the  enpirical 
scientific  technique  of  experimenting  with  am  adequate  model  of  the  system. 
Such  a  model  for  experimentation  cam  be  developed  using  the  techniques  of 
computer  simulation. 
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CHAPTER  19 


QUEUING  MODELS 


INTRODUCTION 


Waiting  lines  or  queues  arise  in  numerous  situations  within  the  logis¬ 
tics  environment.  Essentially,  a  queue  forms  whenever  people  or  objects, 
arriving  at  a  service  facility,  must  wait  because  the  service  operation 
is  busy.  In  these  situations  where  demand  for  service  exceeds  the  capa¬ 
city  of  the  server  or  service  facility,  the  organization  or  mission 
Incurs  direct  or  at  least  indirect  costs  since  the  time  people  or  objects 
spend  in  waiting  is  unproductive.  However,  minimizing  the  cost  of  wait¬ 
ing  is  not  necessarily  an  easy  matter  since  increasing  the  capacity  of 
the  service  facility  to  respond  to  demand  can  lead  to  increases  in  the 
marginal  cost  of  service.  Accordingly,  a  primary  objective  in  the  anal¬ 
ysis  of  queuing  situations  is  to  achieve  a  balance  in  the  tradeoff  between 
the  cost  of  waiting  and  the  cost  of  service.  Many  analytical  models  have 
been  developed  to  describe  certain  operating  characteristics  of  the 
queuing  phenomena  and  we  therefore  useful  to  decision  makers  in  the 
analysis  of  cost  tradeoffs  and  other  aspects  of  the  queuing  system. 

This  chapter  will  survey  several  of  the  fundamental  analytical  techniques. 

The  examples  used  to  illustrate  the  quantitative  models  may  appear 
less  complex  than  many  of  the  queuing  situations  encountered  in  the 
logistics  environment  since  the  assumptions  about  the  queuing  process  being 
described  by  the  models  are  quite  restrictive.  In  reality,  queuing 
situations  can  typically  be  characterized  as  dynamic,  interactive  systems 
with  complex  probabilistic  arrival  and  servicing  patterns.  For  these 
intricate  waiting  line  situations,  computer  simulation,  which  involves 
development  of  a  simulation  model  to  artificially  reproduce  and  describe 
the  queuing  process,  is  a  viable  recourse.  However,  this  does  not  suggest 
that  the  quantitative  models  have  no  utility  for  the  decision  maker.  Their 
importance  is  recognized  in  that  model  output  provides  useful  information 
which,  in  many  instances,  adequately  approximates  the  operating  charac¬ 
teristics  of  the  queuing  process  being  described  and  which  may  be  useful 
in  the  verification  of  simulation  model  relationships. 

The  word  "described"  was  used  above,  with  consideration.  All  queuing 
technologies,  whether  applied  mathematics  or  simulation,  are  descriptive 
devices.  Their  output  is  not  an  optimal  solution  but  rather  a  prediction 
of  the  system's  plausible  future  behavior.  This  prediction  can  only 
assess  the  likelihood  that  the  system  will  be  in  any  of  its  possible  states 
at  any  particular  point  in  time;  that  is,  no  model  has  the  capability  to 
predict  any  future  state  with  certainty. 


QUEUING  SYSTEMS 


The  typical  structure  of  a  queuing  system  is  given  in  Figure  19-1. 


Figure  19-1.  Queuing  System 


System  Boundary 


Service  Facility 


un 
Arrival 
Process 


The  major  components  and  processes  of  the  queuing  system  are  described 
as  follows : 

a.  Population — the  source  of  the  units  arriving  at  the  service  faci¬ 
lity.  This  calling  source  can  consist  of  single  or  multiple  populations 
having  finite  or  infinite  number  of  units. 

b.  Unit  Arrival  Process — the  manner  by  which  units  arrive  at  the 
queuing  system. 

1.  Units  can  arrive  singularly  or  in  bulk  and  under  total,  partial 
or  no  control. 

2.  The  arrival  process  can  be  deterministic  or  stochastic  with 
dependent  or  independent  arrivals. 

c.  Queue— units  awaiting  service. 

1.  The  queue  is  described  by  its  length,  which  may  be  restricted; 
by  the  number  of  lines;  and  by  the  relationship  of  the  lines  to  the  service 
facility. 

2.  Queue  discipline  refers  to  the  behavior  of  arriving  units. 
Arriving  units  may  view  the  line  and  decide  not  to  join  (balk) ;  join  the 
line  and  then  leave  the  system  without  being  served  (renege) ;  exchange 
lines  in  multiple  queue  systems  (jockey);  collude  with  other  units  in  line; 
and  use  a  variety  of  decision  rules,  such  as  random  selection,  in  selecting 
a  line  in  multiple  queue  systems.  Arriving  units  which  randomly  select  a 
line  in  multiple  queue  systems  and  which  do  not  balk,  renege,  collude  or 
jockey,  are  referred  to  as  patient  units. 

d.  Service  Facility—that  entity  in  which  the  process  of  service  is 
embodied. 


1.  The  service  facility  may  have  none  (unit  self  service  process), 
one  or  multiple  servers  in  parallel,  series  or  both  configurations. 

2.  Multiple  servers  may  be  cooperative  or  uncooperative,  meaning 
that  idle  servers  either  assist  or  do  not  assist  busy  servers  in  the 
system. 

3.  The  time  required  to  service  one  unit  in  the  facility  may  be 
deterministic  or  stochastic. 


4.  Service  discipline  describes  the  manner  in  which  units  cure 
selected  for  processing  and  includes,  for  example,  first-come  first-served 
iFCFS) ,  last-come  first-served  (LCFS) ,  service  in  random  order  (SIBO)  and 
preemptive  and  nonpreemptive  priority  disciplines. 


Terminology  and  Notation 


a.  State  of  the  system — N(t) :  typically,  the  number  of  units  in  the 
system  at  time  t. 


b.  State  probabilities — Pn(t):  the  probability  that  the  system  will 
be  in  state  n  at  time  t;  that  is,  n  units  in  the  system  at  time  t. 


c.  Number  of  servers — c . 


d.  Mean  arrival  rate  of  units  when  n  units  are  already  in  the 
system — 


e.  Mean  service  rate  for  the  total  system  when  n  units  are  in  the 
system — yn. 


Steady  State  Notation 


Because  the  behavior  of  a  stochastic,  dynamic  system  is  difficult  to 
capture  in  a  model  consisting  only  of  mathematical  and  statistical 
variables  and  relationships,  such  efforts  always  entail  the  assumption  that 
the  stochastic  processes  generating  the  probabilistic  phenomena  can  and 
have  attained  a  condition  of  steady  state.  This  is  a  condition  of  probabi¬ 
listic  equilibrium  and  indicates  only  that  the  probabilities  of  tran¬ 
sitioning  from  one  state  to  another  are  independent  of  initial  conditions. 
Special  notation  has  been  developed  for  steady  state  conditions.  Some  of 
the  most  frequently  used  are: 


a.  State  probabilities— P^. 


b.  Expected  number  of  units  in  the  system— L  . 


c.  Expected  number  of  units  in  the  queue — L  . 


d.  Expected  number  of  units  in  the  queue  for  a  busy  system— L^. 

e.  Expected  time  in  the  system— W. 


If  ^  is  a  constant  X  for  all  n,  and  ^  is  a  constant  y  for  all  n,  and 
the  symbol  "E"  represents  the  expected  value  or  mean,  then: 

E  [number  in  system]  «  E  [number  in  queue]  +  E  [number  in  service] 


L  =  L  +  L 
s  q  se 

E  [time  in  system]  *  E  [time  in  queue]  +  E  [time  in  service] 


W  =  W  +  — 

s  q  u 


Once  any  three  of  the  seven  variables  have  been  measured  or  estimated, 
these  equations  may  be  used  to  provide  the  solution  for  the  remaining 
four  variables. 

FUNDAMENTAL  QUEUING  MODELS 


The  general  type  of  mathematical/statistical  model  most  commonly 
applied  to  queuing  problems  is  the  Markov  chain.  The  reason  for  this  is 
that  virtually  any  physical  or  logistical  process  involving  the  movement  of 
resources  or  information  can  be  visualized  as  a  ramified  network.  Such 
networks  lend  themselves  to  investigation  by  Markov  analysis.  All  of  the 
following  models  are  based  upon  a  special  Markov  process  known  as  the 
"birth-death"  process.  They  each  assume  that  inputs  are  generated  by  a 
Poisson  process  and  that  service  times  are  described  by  the  negative  expo¬ 
nential  distribution.  The  models  will  follow  the  notation  of  Budnick, 
Mojena  and  Vollman1. 

The  Basic  Models 

Assume  XR»  X  for  all  n  and  let  p  ■  —  be  the  system  utilization  factor. 

Then,  for  the  case  where  c  ■  1  (the  single  queue-single  server  M/M/1  model 
using  Kendall's  notation),  we  have: 


X  -  X 
n 


for  all  n 


n 


If  p  <  1,  then  this  model  will  eventually  attain  steady  state.  Some  impor¬ 
tant  results  for  this  model  are: 


PQ  -  1  -  P  (5) 


P(n  >  0)  -  p 

(6) 

P  m  P  0** 
n  o 

(7) 

(8) 

L  - 

q  u-i 

(9) 

h>  “  Fx 

(10) 

Bs  m£x 

(11) 

^q  =  y-T 

(12) 

wb--^x 

(13) 

For  the  case  where  c  >  1  (the  single  queue  -  multiple  server  M/M/c 
model  from  Kendall's  notation)  we  have: 

\  *  \  for  all  n 

n 

ny  for  n  c 

u  » 
n 

cu  for  n  ^  c 

The  steady  state  requirements  are  p  “  ^  <  1  and  the  model  results  are : 


P 

o 


P 

n 


£ -  . 

cicn-c 


for  0  £  n  <_  c 

P  for  n  >  c 
o  — 


(14) 


(15) 


1F.  S.  Budnick,  R.  Modena,  and  T.  E.  Vollmann,  Principles  of  Operations 
Research  for  Management  (Homewood,  II:  Richard  D.  Irwin,  Inc.,  1977),  Chap  12. 
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P(n  >  c) 


(16) 


pV  .  P 

cl(yc-X)  o 


L 
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e+i  .  p 
P  O 

(c-1) ! (c-p) 2 


(17) 


L  ■  L  +  p 
s  q 


_  _ 

"  P(n  >  c) 


«  -i*  ♦  i 

s  X  y 


W  -  t1 

q  X 


W_ 


W, 


b  P(n  >  c) 


(18) 

(19) 

(20) 

(21) 

(22) 


Finite  Queues 


If  the  number  of  units  in  the  system  cannot  exceed  some  number,  say  N, 
then  the  models  must  be  modified  accordingly.  In  these  cases  we  have: 


X  * 
n 


X 

0 


for  n  ■  0,  2,  N" 1 

for  n  >  N 


u  »  y  for  all  n 
n 

There  are  no  steady  state  requirements  since  X^  ■  0  for  some  state . 
For  the  c  «  1  case  (M/M/1  model),  the  results  are: 
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1=£ 


1-p 


N+l 


1 

N+l 


for  Xj<  y 


for  X*  y 


(23) 


L9-6 


(35) 
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+  pu-v 
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d  P(n  >  c) 
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1 
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w„ 


w. 


P(n  >  c) 


(36) 

(37) 

(38) 

(39) 

(40) 


Finite  Populations 

If  the  size  of  the  calling  population  is  finite,  say  of  size  m,  then  A 
is  now  defined  as  the  effective  arrival  rate  A(m-L  ) , u  ■  y  for  all  n, 
and  the  models  are  modified  as  follows:  s  n 


For  the  case  where  c  ■  1: 
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ml 

Cra-i) 1 
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for  n  <  m 


(41) 
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Final  Note 

It  should  be  obvious  from  the  complexity  of  the  formulas  presented  for 
these  simple  queuing  situations  that  the  applicability  of  mathematical 
techniques  is  limited  for  many  practical  problems.  When  such  customer 
behaviors  as  balking,  reneging,  jockeying  and  colluding  are  considered 
along  with  complex  structural  networks  of  interacting  queues  in  a  system, 
analytical  techniques  cannot  even  be  derived. 

Large  scale,  complex,  dynamic  systems  can  be  modeled  effectively, 
however,  through  the  use  of  simulation.  During  the  past  twenty-five  years , 
several  programming  languages  have  been  developed  which  greatly  enhance  our 
ability  to  model  and  analyze  complex  systems.  Some  examples  of  these  are 
Dynamo,  GPSS,  GASP  IV,  Q-GERT,  and  SLAM. 

EXAMPLES  OF  MATHEMATICAL  MODELS 

Single  Server  Queuing  Model  with 
Poisson  Arrivals  and  Exponential  Service 

A  base  supply  officer  is  considering  the  establishment  of  a  customer 
relations  desk  at  base  supply  to  assist  customers  with  supply  related 
problems.  The  supply  officer  estimates  that  customers  will  arrive  for 
assistance  at  the  rate  of  15  per  hour.  The  supply  clerk  being  considered 
for  the  customer  relations  job  can  service  customers  at  the  rate  of  one 
every  three  minutes.  The  supply  officer  wishes  to  determine  the  following: 

a.  Utilization  of  the  supply  clerk. 

b.  Average  number  of  customers  in  the  waiting  line  for  assistance. 

c.  Average  number  of  customers  either  waiting  for  assistance  or  in 
the  process  of  receiving  assistance. 

d.  Average  waiting  time  in  the  line. 

e.  Average  time  spent  waiting  in  line  and  receiving  assistance. 

f.  The  service  rate  necessary  to  reduce  the  average  number  of 
customers  in  the  total  system  to  two  or  less  customers. 

Assumptlons/Proper+ies  of  the  Model: 

1.  Single  server  layout  with  a  single  queue. 

2.  Single  service  phase. 

3.  Infinite  source  population. 
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4. 


Poisson  arrival  pattern. 


5.  First  come,  first  served  queue  discipline. 

6.  Exponential  service  pattern. 

7.  Unlimited  permissible  queue  length. 

Solution ; 

a.  The  average  utilization  of  the  supply  clerk  (p)  is  given  by: 


p  *  —  where, 

U 

X  =»  customer  arrival  rate  (arrivals  per  hour) , 
y  *  customer  service  rate  (services  per  hour) . 

The  stated  service  rate  of  one  customer  every  three  minutes  must  be  con¬ 
verted  to  the  proper  terms  (services  per  hour) . 

1  service  „  60  minutes 

U  =  —  —  x  i  uv... -  *  20  services  per  hour. 


3  minutes 


1  hour 


Then,  by  substitution: 


15 


p  »  —  -  .75  »  75  percent  utilization  for  the  clerk. 

b.  The  average  number  in  the  waiting  line  (Lq)  is  given  by  substitu¬ 
tions  into  equation  (9) : 


.75(15) 

20-15 


2.25  customers 


c.  The  average  number  of  customers  in  the  waiting  line  or  receiving 
assistance  (average  number  of  customers  in  the  total  system,  Ls)  is  deter¬ 
mined  by  equation  (8) : 


15 


20-15 


3  customers 


d.  The  average  waiting  time  in  the  line  (Wq)  is  given  by  equation 
(12): 


.75 

20-15 


.15  hour  or  9  minutes  per  customer 


e.  The  average  waiting  time  in  the  line  and  receiving  assistance 
(average  waiting  time  in  the  total  system,  Ws)  is  given  by  equation  (11) 


ws  - 


1 

20-15 


.20  hour  or  12  minutes  per  customer. 


f.  The  service  rate  necessary  (for  the  supply  clerk)  to  reduce  the 
average  number  of  customers  in  the  total  system  to  two  or  less  is  deter¬ 
mined  using  equation  (8) : 


15 

y-15 


<  2, 


2u  -  30  >  15, 


U  ^22.5  services  per  hour 

Multiple  Server  Queuing  Model  with 
Poisson  Arrivals  and  Exponential  Service 

In  the  direct  exchange  section  of  a  maintenance  supply  facility,  acti¬ 
vities  are  authorized  to  exchange  malfunctioning  assemblies  and  sub¬ 
assemblies  for  good  ones.  The  bad  components  are  then  forwarded  to  the 
appropriate  maintenance  shops  for  repair.  Normally,  a  mechanic  from  the 
using  activity  will  present  the  bad  assembly  to  the  clerk  in  the  direct 
exchange  section.  The  clerk  fills  out  the  required  paper  work  and 
exchanges  the  item  while  the  mechanic  waits.  Mechanics  arrive  in  a  random 
(Poisson)  pattern  at  the  rate  of  40  per  hour  and  a  clerk  can  fill  requests 
at  the  rate  of  20  per  hour  (exponential) .  If  the  cost  for  a  supply  clerk 
is  $2.00  per  hour  and  the  cost  for  a  mechanic  is  $4.50  per  hour,  determine 
the  optimum  number  of  clerks  to  staff  the  direct  exchange  section. 


Assumptions/Properties  of  the  Model: 

1.  Multiple  server  layout  with  a  single  queue. 

2.  Single  service  phase. 

3.  Infinite  source  population. 

4.  Poisson  arrival  pattern. 

5.  First  come,  first  served  queue  discipline. 

6.  Exponential  service  pattern. 

7.  Unlimited  permissible  queue  length. 


Solution: 

The  first  step  is  to  determine  the  average  number  of  mechanics  waiting 
in  line  at  the  direct  exchange  counter.  The  cost  of  this  "idle"  time  is 
then  compared  to  the  cost  of  adding  additional  clerks.  To  begin,  assume 
that  three  clerks  (c  ■  3)  will  be  utilized  because  with  a  high  arrival  rate 
in  relation  to  the  service  rate,  a  system  with  one  or  two  servers  would 


result  in  excessively  long  lines.  The  average  number  of  mechanics  in  the 
line  is  computed  by  equations  (14)  and  (17)  (note  that  p  ■  X/u  ■  40/20  *  2) 


For  an  eight  hour  day  at  $4.50  per  hour,  the  value  of  the  mechanic's  idle 
time  can  then  be  computed  as:  .888  mechanic  x  $4.50  per  hour  x  8  hours  ■» 
$31.97. 

The  next  step  is  to  recalculate  the  waiting  time  if  another  clerk  is 
added  to  the  system.  Then,  compare  the  added  cost  of  the  additional  clerk 
to  the  value  of  the  time  saved  by  the  mechanics  in  the  line.  Using 
equations  (14)  and  (17),  when  c  ■  4. 


25  (.130)  .  . 

Lq  ■  -  *  .173  mechanics 

(4-1) 1 (4-2) 2 

The  value  of  the  mechanic's  idle  time  with  four  clerks  is  as  follows: 

.173 mechanic  x  $4.50  per  hour  x  8  hours  *  $6.23.  Therefore,  the  value 
of  mechanic's  time  saved  in  the  line  is  $31.97  -  $6.23  -  $25.74.  The 
cost  of  the  additional  clerk  is  $2.00  per  hour  x  8  hours  »  $16.00.  The 
cost  reduction  obtained  by  adding  the  fourth  clerk  is  $25.74  -  $16.00  ■ 
$9.74,  and  the  addition  of  a  fourth  clerk  is  thus  advantageous. 

The  next  step  is  to  recalculate  the  waiting  time  if  a  fifth  supply 
clerk  is  added  and  compare  the  additional  cost  with  the  value  of  the  time 
saved  by  the  mechanics  in  line.  From  equations  (14)  and  (17),  it  can  be 
determined  the  ■  .0397  mechanics.  The  cost  of  idle  time  is  now  .0397 
mechanics  x  $4.50  per  hour  x  8  hours  <■  $1.43.  The  value  of  mechanic’s  time 
saved  (compared  to  the  three  clerk  system)  is  $31.97  -  $1.43  ■  $30.54.  The 
cost  of  two  additional  clerks  is  2  clerks  x  $2.00  per  hour  x  8  hours  - 
$32.00,  which  is  greater  than  the  value  of  the  time  saved.  These  results, 
therefore,  suggest  that  four  clerks  should  be  used  in  this  multiple  server 
system. 
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Slngel  Server  Queuing  Model 
with  Limited  Queue  Length 

A  NATO  country  is  requesting  USAF  airlift  support  for  a  large  scale 
movement  requirement  of  recently  procured  explosive  cargo.  It  is  proposed 
that  the  airlift  aircraft,  to  be  chartered  under  AFR  76-28  rate  tariffs, 

will  offload  the  munitions  at  a  USAF  installation  which  is  colocated  on  a 

civilian  airfield  within  the  NATO  country.  The  installation's  aerial  port 
squadron  plans  to  use  an  isolated  offload  ramp,  which  has  space  to  offload 
one  aircraft  at  a  time,  and  an  entrance  taxiway  which  has  space  for  three 
aircraft  waiting  to  be  offloaded.  The  average  arrival  rate  of  airlift 
aircraft  is  estimated  at  two  aircraft  per  hour  and  the  offload  rate  is 
estimated  at  2.5  aircraft  per  hour.  The  marginal  revenue  derived  from  the 

rate  tariffs  to  cover  certain  operating  expenses,  is  estimated  at  $500  per 

aircraft  load.  Additional  ramp  space  for  holding  aircraft  waiting  to  be 
offloaded  is  available  on  the  civilian  airfield  at  a  leased  rate  of  $250 
per  day  per  space.  The  airfield  will  be  open  14  hours  a  day  for  this 
operation. 

Assuming  Poisson  arrivals  and  exponential  service,  should  additional 
aircraft  parking  space  be  rented  on  the  civilian  airfield  to  expand  the 
restricted  queue?  If  so,  how  many  spaces?  In  addition,  what  will  be  the 
average  number  of  aircraft  in  the  queuing  system,  both  in  line  and  being 
offloaded? 

Assumptions/Properties  of  the  Model: 

1.  Single  server  layout  with,  a  single  queue. 

2.  Single  service  phase. 

3.  Infinite  source  population. 

4.  Poisson  arrival  pattern. 

5.  First  come,  first  served  queue  discipline. 

6.  Exponential  service  pattern. 

7.  Limited  permissible  queue  length. 

Solution: 

One  approach  to  solving  this  problem  is  to  assume  an  increasing  number 
of  aircraft  parking  spaces  and  compare  the  additional  revenue  generated  by 
each  additional  space.  Additional  spaces  will  be  rented  until  the  revenue 
becomes  less  them  the  $250  cost  of  leasing. 

Aircraft  will  be  offloaded  at  the  rate  of  2.5  aircraft  per  hour  any 
time  there  au:e  aircraft  in  the  queuing  system.  To  find  the  amount  of  time 
there  will  be  aircraft  in  the  system,  we  cam  solve  for  the  probability  of 
zero  in  the  system  using  quation  (23)  and  subtract  this  result  from  1.00 
in  equation  (24)  to  derive  the  percent  of  time  that  aircraft  are  being 
offloaded  (alternatively,  the  percent  of  time  that  the  system  is  busy) . 
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For  the  probability  of  zero  aircraft  in  the  system  with  four  spaces 
available  in  the  total  queuing  system: 
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1-.8 
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.297 


P(n  >  0)  ■  1  -  P  -  1  -  .297  -  .703 
o 

Therefore,  aircraft  will  be  in  the  process  of  being  offloaded  (the  system 
will  be  busy)  70.3%  of  the  time. 

When  one  space  is  rented,  N  -  4  +1,  and  equations  (23)  and  (24)  give: 
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1-.8 
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.271 


P(n  >0)  -  1  -  .271  -  .729 


In  this  case,  service  will  be  carried  out  72.9%  of  the  time,  or  an 
increase  of  2.6%  (72.9%  -  70.3%).  In  revenue,  this  is  worth  .026  x  2.5 
aircraft  per  hour  x  14  hours  x  $500  per  aircraft  or  $455.  This  is  con¬ 
siderably  more  than  the  cost  of  leasing  the  additional  space. 

When  a  second  parking  space  is  leased,  N  ■  4  +  2,  and  equations  (23) 
and  (24)  give: 


P 

o 


1-.8 


1-.8 


6+1 


.253 


P(n>0)  ■  1  -  .253  *  .747 


Thus,  with  six  total  spaces  in  the  queuing  system,  the  aerial  port  squadron 
will  be  busy  74.7%  of  the  time  offloading  aircraft.  The  additional  space 
increases  utilization  of  the  aerial  port  squadron  by  1.8%  x  2.5  aircraft 
per  hour  x  14  hours  x  $500  per  aircraft,  or  $315.  Since  this  amount  is 
greater  than  the  $250  rental  charge,  the  space  should  be  rented. 
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A  third  leased  space,  with  N  ■  4  +  3,  gives: 


1-.8 


1-.8 


7+1 


.240 


P(n  >  0)  -  1  -  .240  »  .760 


The  increased  service  utilization  is  1.3%  (76.0%  -  74.7%)  and  the  added 
revenue  is  given  by:  .013  x  2.5  aircraft  per  hour  x  14  hours  x  $500  per 
aircraft  *  $227.50.  Thus ,  the  third  space  should  not  be  rented  since  it 
would  incur  a  loss  of  $22.50  ($250  -  $227.5). 

A  reasonable  solution,  th«i,  is  to  lease  just  two  additional  spaces. 
The  revenue  is  given  by:  .747  x  2.5  aircraft  per  hour  x  14  hours  x  $500 
per  aircraft  or  $13,072.50. 

To  determine  the  average  number  of  aircraft  in  the  total  system  with 
two  additional  spaces  to  be  leased,  we  rie  equation  (26) : 

N  »  6 


L 


s 


.8 

1-.8 


(6+1) .8 


6+1 


1-.8 


6+1 


*2.14  aircraft. 
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CHAPTER  20 


RELIABILITY 


INTRODUCTION 

The  logistics  manager  is  constantly  faced  with  the  problem  of  increas 
ing  support  costs  in  an  era  of  scarce  resources.  Numerous  studies  have 
shown  that,  in  large  part,  support  costs  are  related  to  system  and  com¬ 
ponent  reliability.  For  this  reason  it  becomes  increasingly  important 
for  the  logistics  manager  to  have  an  understanding  of  the  basic  concepts 
and  techniques  of  reliability  theory.  The  following  sections  present 
some  of  the  tools  which  can  be  applied  by  the  logistics  manager  to  prob¬ 
lems  and  decisions  in  the  area  of  reliability.  In  no  way  is  this  chapter 
intended  to  be  a  complete  treatise  on  reliability  theory.  The  models 
are  presented  without  derivation  or  proof.  Numerous  examples  are  pro¬ 
vided  to  illustrate  the  application  of  the  principles  and  models  in  the 
logistics  arena. 

DEFINITIONS 


Numerous  terms  and  phrases  are  used  consistently  in  the  literature 
and  in  practice.  These  are  listed  below  along  with  their  accepted  usage. 

1.  Availability  -  A  measure  usually  stated  as  a  probability  that  a  sys¬ 
tem  will  operate  satisfactorily  when  called  upon  to  operate  under  stated 
conditions,  usually  in  an  operational  environment.  Mathematically, 
availability  is  given  as: 


Availability  » 


MTBF 

MTBF  +  MTTR 


2.  Failure  Rate  -  The  rate  at  which  failures  occur  during  a  specified 
interval  of  time,  t  to  t  +  a,  given  by: 


X  (t,a)  - 


R(t)  -  R(t  +  a) 
aR(t) 


3.  Hazard  Rate  -  Instantaneous  rate  of  failure  given  as: 


z(t)  -  lim  X  (t,a) 


where  f (t) 


the  failure  rate  distribution 


and  R(t) 


the  reliability  function  which  can  be  found 
by 


R(t) 


1  ~Jt  f(t)dt 
o 

1  -  F(t) 


4.  Mean  Tine  Between  Failures  (MTBF)  -  The  average  life  of  a  system  or 
equipment.  MTBF  is  considered  to  be  a  primary  measure  of  reliability. 

MTBF  - /"  R(t)dt. 

•'o 

5.  Mean  Time  to  Failure  (MTTF)  -  Means  the  same  as  MTBF  except  that 
MTTF  is  used  for  items  which  will  not  be  repaired. 

6.  Mean  Time  to  Repair  (MTTR)  -  The  average  time  to  repair  an  item 
including  both  scheduled  and  unscheduled  maintenance. 

7.  Median  Life  -  The  time  period  for  which  the  system/equipment 
reliability  is  0.50,  i.e.,  half  of  the  items  will  fail  after  their  median 
life  and  half  before  the  median  life. 

8.  Parallel  Design  -  Two  or  more  components  are  connected  in  such  a  way 
that  each  must  fail  in  order  for  the  entire  system  to  fail. 

Figure  20-1 


Since  this  system  has  alternate  paths  of  operation,  both  A  and  B  must 
fail  for  the  entire  system  to  fail. 

9.  Redundancy  -  Associated  with  parallel  design.  Redundancy  usually 
increases  reliability  by  providing  alternate  paths  of  operation. 

10.  Reliability  -  The  probability  that  a  system  or  equipment  will  operate 
over  a  stated  period  of  time  for  its  intended  purpose  under  stated  con¬ 
ditions. 

11.  Reliability  Growth  -  Improvement  (increase)  in  an  item's  reliability 
after  the  "bum-in"  period  due  to  the  conscious  efforts  of  engineers  and 
management  usually  requiring  an  expenditure  of  resources. 


12 .  Reliability  Improvement  Warranty  (RIW)  -  A  contractual  technique 
whereby  the  contractor  is  provided  with  incentives  to  reduce  system 
support  costs  through  reliability  improvements  over  a  stated  period  of 
time. 

13.  Series  Design  -  System/equipment  design  configuration  in  which  all 
components  must  operate  for  the  system  to  operate. 

Figure  20-2 


If  component  A  or  B  fails,  the  system  fails. 

BASIC  CONCEPTS 

Reliability  is  concerned  with  system/component  failure,  these 
failures  have  been  categorized  into  three  types  which  are  illustrated  in 
the  figure  below. 


Figure  20-3 


Decreasing  Constant  failure  Increasing 

failure  rate  failure 

rate  rate 


1.  Early  failure  -  occurs  during  system  "burn-in"  or  debugging  phase 
these  are  a  result  of  the  manufacturing  or  assembling  process  when  the 
equipment  and/or  techniques  are  new  or  highly  complex. 

2.  Wearout  failure  -  Usually  a  result  of  the  aging  process  and  when 
items  are  not  properly  maintained  or  parts  replaced  when  required. 

3.  Chance  failure  -  Occurs  randomly  throughout  the  item's  useful  life 
and  is  the  most  difficult  to  reduce  or  eliminate. 


DETERMINING  RELIABILITY 

the  theory  and  practice  of  reliability  is  closely  associated  with 
the  application  of  statistical  probability.  In  reliability,  only  two 
outcomes  are  possible— success  or  failure.  Either  a  system  will  operate. 


or  it  ww.'c.  These  outcomes  are  then  mutually  exclusive  since  both  cannot 
occur  at  any  one  point  in  time.  It  is  normally  the  best  policy  to  define 
failure  and  let  all  other  possibilities  be  defined  as  success. 

After  the  exact  definition  of  failure  has  been  specified,  the  fre¬ 
quency  at  which  failures  occur  then  becomes  a  parameter  for  the  statis¬ 
tical  formulation  of  reliability.  This  parameter  is  called  a  failure 
rate  which  is  expressed  as  the  number  of  failures  per  period  of  time . 

Its  reciprocal  is  the  mean  time  between  failures  (MTBF)  defined  in  the 
previous  section. 

Failure  rates  are  determined  experimentally  by  testing  the  systems 
under  study  or  by  observing  their  performance  over  a  period  of  time.  It 
is  usually  best  to  obtain  a  large  number  of  trials  before  establishing  firm 
statements  about  reliability.  For  example,  we  would  not  hurriedly 
assume  that  a  coin  is  biased  if  three  heads  and  seven  tails  appeared  in 
ten  tosses.  In  this  respect,  we  should  not  automatically  jump  to  con¬ 
clusions  about  a  system’s  reliability  based  on  a  relatively  short 
experiential  period. 

In  determining  reliability,  probability  theory  plays  an  important 
role.  Analytically,  the  reliability  of  a  system  at  a  given  point  in 
time,  t,  is  expressed  as: 

Rft1)  -  1  -  Fft1) 

The  interpretation  is  in  probabilistic  terms.  Thus,  Rtt1)  is  the 
probability  that  a  failure  occurs  after  time  t*.  Conversely,  Rlt1)  is 
the  probability  that  the  system  will  not  fail  before  t  .  Graphically, 
this  principle  idea  is  shown  below. 

Figure  20-4 


The  amount  of  time  to  failure  is  defined  as  a  probability  density 
function,  f(t),  where 


f(t) 


dF  (t) 
dt 


Without  proof,  it  can  be  stated  that  reliability,  R(t) ,  is  represented 
by  an  area  under  the  time  to  failure  density  curve,  i.e . ,  a  probability. 
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Therefore,  0  <  R(t)/<  1 

Once  the  tine  to  failure  distribution  is  known,  i.e.,  its  probability 
density  function,  the  reliability  of  the  system  can  be  readily  determined. 


Since 

f(t) 

dF  (t) 

'/  * 

and 

R(t) 

•j  1  -  F  (t) 

then 

R(t) 

-  1  f  (t)dt 

In  other  words,  the  reliability  of  a  system  or  the  probability  that 
it  will  operate  successfully  up  to  time  t  is  found  by  integrating  the 
density  function  from  o  to  t  and  subtracting  the  result  from  one. 

Obviously,  finding  the  reliability  of  today's  technologically  complex 
systems  is  not  so  straightforward  and  simple.  Yet,  this  basic  principle 
applies.  Let  us  proceed  to  demonstrate  the  calculation  of  reliability 
under  several  different  circumstances. 

When  system  design  and  the  reliability  of  components  are  known,  sys¬ 
tem  reliability  can  be  obtained  by  directly  applying  the  laws  of  proba¬ 
bility  and  Boolean  algebra. 


Figure  20-5 

Components  in  Parallel 


n  « 


Ub 


Ra  (reliability  of  component  a)  «  ■  0.7 

R(S)  ■  R^  U  R^  »  reliability  of  the  system 

*  Ra  +  *b  -  Ra  *  *b 

-  0.7  +  0.7  -  [(0.7)  (0.7)] 

-  1.4  -  .49  -  0.91 

For  components  in  parallel,  the  probability  rule  for  the  union  of  events 
applies.  Further,  in  all  cases,  the  reliability  for  each  component  is 
taken  to  be  independent  of  all  others. 
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Figure  20-6 


Components  in  Series 


Ra  "  *b  "  °*7 

R(S)  -  Ra  0  "  Ra  *  \ 

-  (0.7)  (0.7) 

-  0.49 

The  multiplication  rule  applies  since  a  and  b  must  operate  or  the 
system  fails.  This  implies  that  no  matter  how  high  component  reliability 
may  be,  system  reliability  decreases  as  more  components  are  added  in 
series . 

Hie  following  example  illustrates  a  combination  of  parallel  and 
series  designs,  but  uses  the  same  probability  rules  in  determining  system 
reliability. 


Figure  20—7 


The  reliability  of  each  component  is: 


R 

a 

m 

.92 

Rf  " 

.96 

*b 

m 

.97 

R 

9 

.93 

R 

c 

m 

.98 

*h  = 

.98 

Rd 

m 

.98 

Ri  " 

.93 

R 

m 

.98 

e 


To  find  the  reliability  of  the  system,  the  following  expression  is 
solved:  (Capital  letters  are  used  for  the  components) 

R(S)  =  {[(AflB)  U  (cnD)]flE}  U  (Ffl  [(GflH)  U  i]} 


To  simplify  the  problem,  additional  components  are  defined  and  substituted 
into  the  original  expression. 


Let  R  « 

(AflB)  = 

(.92) (.97)  - 

.8924 

S  » 

(CflD)  » 

(.98)2 

.9604 

T  = 

(G  f|H)  - 

(.93) (.98)  - 

.9114 

The  original  expression  becomes : 

r(s)  -  [<Rus)nE]ul>n(Tui)] 

■  [(R  +  S  -  R  •  S)  •  E]  U[F  .  (T  +  I  -  T  •  I)] 

-  [(.8924  +  .9604  -  .8924  •  .9604) ( .98) ] U 

[.96  *  (.9114  +  .93  -  .9114  •  .93)] 


Additionally,  if  we  let 


X  ■  [(R  +  S  -  R  •  S)  *  E] 
Y  -  [F  •  (T  +  I  -  T  •  I)  ] 


we  need  only  to  solve  for 


R(S)  «*  XU  V 

-  X  +  Y  -  X  •  Y 

-  .97582  +  .95404  -  .97582  *  .95404  -  .99888 

System  or  component  reliability  is  also  determined  through  the  use 
of  reliability  models.  Three  basic  models  are  called:  constant  hazard, 
linear  increasing  hazard,  and  Weibull. 

Constant  Haz ard  Model.  The  constant  hazard  model  has  already  been 
introduced  in  Chapter  7  as  the  exponential  distribution,  sometimes 
referred  to  as  the  negative  exponential.  It  is  characterized  by  a  curve 
which  is  downward  sloping  to  the  right.  This  model  is  often  used  to 
determine  the  reliability  of  components,  particularly  electronic  components, 
after  the  bum-in  period,  but  prior  to  the  wearout  phase  of  the  life  cycle. 
During  this  period,  failures  are  considered  to  occur  randomly  (by  chance) 
rather  than  as  the  result  of  a  specific  failure  mechanism.  Within  this 
period  of  operating  life,  the  reliability  (or  probability  of  survival) 
is  the  same  for  all  periods  of  equal  length.  The  number  of  failures 
occurring  during  any  time  interval  appears  to  be  related  only  to  the  total 
number  of  units  in  operation  and  the  period  of  time  in  the  interval.  For 
each  period,  the  operating  time  begins  at  t  ■  0.  The  component  or  equip¬ 
ment  is  considered  "new"  at  this  time  in  that  it  has  survived  previous 
missions  or  operating  periods. 

Parameters  X  *  : 

Hazard  function:  z(t) 

Density  function  of  time  to  failure :  f (t) 

Cumulative  distribution  function 

of  time  to  failure:  F(t) 

Reliability  function:  R(t) 

Mean  time  to  failure  or  mean  time 

between  failures :  MTTR 

The  figures  below  depict  the  exponential  hazard  ai 

Figure  20-8 


failures  per  time  period 


Xe 


-Xt 


1  -  e 
-Xt 


-Xt 


_1_ 

X 


id  density  functions. 


z(t) 


*  b 


Constant  hazard 


T 


Note  that  the  rate  of  failure,  X,  is  the  same  for  all  t. 

Figure  20-9 


It  should  be  noted  that  when  the  MTBF  (or  MTTF)  is  equal  to  t,  the 
reliability  is  0.368  as  shown  belcw. 

Figure  20-10 


It  should  also  be  noted  that  given  a  stated  reliability  over  a  time 
interval,  mean  life  (MTBF  or  MTTR)  can  be  easily  found  by: 

MTBF  t)  ’  'iog*Rit)~ 

For  example,  if  the  reliability  of  a  component  for  a  10-hour  mission 
is  .904,  what  is  the  component's  mean  life? 


MTBF  “ 

-t 

log  R(t) 

MTBF  ■ 

-10 

log  0.904 
’e 

MTBF  « 

99.08  hours 
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At  this  point  a  work  of  caution  needs  to  be  stated.  It  is  important 
that  the  distribution  of  failures  be  known  prior  to  making  any  state¬ 
ments  about  system/ component  reliability.  In  too  many  cases,  the 
analyst  elects  to  apply  the  constant  hazard  model  because  it  is  very 
easy  to  use.  This  may  lead  to  incorrect  assumptions  about  reliability 
which  can  result  in  failed  missions  or  excessive  costs  for  system  s import 
or  both .  in  this  respect,  many  components/systems  have  been  noted  to 
have  failure  distributions  which  have  changed  over  time  as  shewn  in  the 
figure  below. 


Figure  20-11 


In  this  case,  a  constant  hazard  model  could  be  used  for  the  first 
2000  hours  of  operation  after  which  a  linear  increasing  hazard  model 
might  be  applicable. 

When  the  individual  is  quite  sure  that  an  exponential  model  is 
applicable,  reliability  can  be  determined  from  raw  data  from  a  test  as 
follows . 

EXAMPLE  20-1 


Twenty  items  are  put  on  test.  Fad  lures  occurred  at  the  following 
times  given  in  hours:  20,  27,  30,  41,  42,  48,  60,  66,  80,  81,  83,  86, 

105,  108,  and  130.  Five  items  survived  past  130 
hours. 


First,  compute  an  estimate  of  the  rate  of  failure  by: 


MTBF 


_1_ 

/N 

X 


total  time  for 
failed  items 


time  for  non- 
fed,  led  items 


number  of  failures 


MTBF 


1007  +  650 
15 

110.47  hours 

1 

X 


MTBF 

1 

110.47 

0.00905 


The  reliability  function  isj 


R(t) 


R(t) 


t 

110.47 

e 


-  0. 00905t 
e 


The  reliability  of  the  item  for  a  10-hour  mission,  then  is: 


RC10) 


10 

110.47 

e 


-  0.9135 


Llneeur  Increasing  Hazard  Model.  The  distribution  for  this  model  is 
called  the  Rayleigh  distribution.  This  distribution  is  usually  most 
applicable  when  sy stems /components  begin  to  wear  out. 
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Figure  20-13 


Rayleigh  density  function 


EXAMPLE  20-2 

Failure  data  collected  on  the  Guidance  and  Control  Unit  of  a  missile 
indicate  that  the  unit  failures  follow  a  linear  increasing  model  with 
a  «=  .0022.  What  is  the  unit's  MTBF  and  reliability  for  a  mission  time 
of  12  hours? 


MTBF 


f 


3.1416 


(2) (.0022) 

■  26.72  hours 

To  find  the  reliability  for  a  12-hour  mission: 

_  (.0022)  (12) 2 

R(12)  -  e  2 


0.8535 


Weibull.  The  most  complex  of  the  distributions  used  in  reliability,  the 
Weibull  distribution  has  great  flexibility  and  is  used  widely  in  assess¬ 
ing  failure  distributions  of  semi-conductors,  mechanical  devices,  and 
component  unit  levels. 

Parameters:  a  ■  scaling  parameter 
3  =  shaping  parameter 

Y  *  location  parameter 

The  Weibull  is  actually  a  family  of  distributions  depending  upon  the 
values  of  its  parameters.  For  example,  when  6  ■  1  and  y  ■>  0,  the  distri¬ 
bution  is  exponential  and  a  is  the  MTBF,  i.e.,  in  the  constant  hazard 

model.  At  the  same  time  for  6  <  1  the  hazard  rate  is  a  decreasing 
function,  and  for  8  >  1  the  hazard  rate  increases  with  time.  When 
a  »  1  and  y  -  0,  the  distribution  is  near  normal  when  8  M  3.2.  In 
reliability  applications,  the  Weibull  is  generally  used  with  only  two 
parameters  a  and  8,  with  y  -  0.  The  values  for  the  parameters  can  be 
obtained  with  reasonable  accuracy  by  plotting  failure  data  on  Weibull 
probability  paper.  The  reader  should  review  more  advanced  texts  in 
reliability  theory  for  a  complete  explanation.  The  figures  below  illus¬ 
trate  various  shapes  of  the  Weibull  with  different  values  of  a  and  8. 


Hazard  function: 


z(t) 


Density  if1  inction  of  time 
to  failure: 


f(t) 


Cumulative  distribution 
function 


Reliability  function :  — 

R(t)  -  1  -  F (t)  R (t)  -  e  a 


Mean  time  to  fad  lure:  i 

«>  6  /  X 

MTTF  •fQn  R(t)  at  MTTF  -  a  T  (y  +  lj 

* 

where  T  Cn  +  1)  *  n  r  (n) 

or  T  (n)  ■  n-1  T  (n-1) 

*See  Selby,  S.  M.  (ed) ,  CRC  Standard  Mathematical  Tables.  Cleveland, 
The  Chemical  Rubber  Company,  1975. 


Figure  20-14 

Wfeibull  distributions  with  Different  Values  of  a;  6-2;  y  ■  0 


Figure  20-15 

Weibull  Distributions  with  Different  Values  of  6;  a  »  1,  y  =  0 


The  density  and  reliability  functions  are  somewhat  different  when 
the  third  parameter  y  is  added.  This  location  parameter  is  used  only 
when  there  is  a  substantial  time  period  during  which  no  failures  are 
likely  to  occur.  For  discussions  of  the  three  parameter  Weibull  distri¬ 
bution  ,  texts  on  reliability  mathematics  should  be  consulted. 

EXAMPLE  20-3 


Failures  of  tires  on  a  particular  ear craft  follow  a  distribution 
pattern  which  is  closely  approximated  by  a  two-parameter  Weibull  model 
with  a  *  262.79  and  8  ■  1.13.  This  distribution  was  determined  accord¬ 
ing  to  the  following  relationship:  2  take-offs  +  2  landings  *  1  "time 
unit."  What  is  the  me  am  time  to  failure  for  this  type  of  tire?  What  is 
the  reliability  of  the  tire  if  its  mission  length  is  set  at  14  take-offs 
and  14  lamdings? 


MTTF  -  262 


rn  (-+— 

.79  ■*"*  r\1.13 


+  1 


262. 79* 885  T  (1.885) 


-  (138.469) (.95673) 

-  132.48 


The  MTTF  must  be  converted  to  "Time  Units"  which  would  be  2  x  132.48 
«  265  take-offs  and  lamdings. 

* 

T (1.885)  interpolated  from  table  on  p.  533,  CRC  Standard  Mathematical 
Tables ,  1975. 

To  find  the  reliability  of  14  tadce-offs  and  lamdings,  the  conversion 
would  make  this  equivalent  to  -=j-  or  7  time  units. 


-  7 


1.13 


R(t) 


262.79 


0.9663 


To  illustrate  the  Weibull  as  a  constant  hazard  model  (8  ■  1) ,  the 
previous  example  for  the  constant  hazard  model  will  be  reiterated. 
Recall  that  X  in  the  example  was  found  to  be  0.00905.  In  the  Weibull 
model,  a  *  X. 


110.497 


(i*1) 


-  (110.497) (1) 

=  110.497  hours 

To  find  the  reliability  for  a  10 -hour  mission: 

ID1 

"  110.497 


0.9135 


The  similarity  (and  flexibility  of  the  Weibull  model)  can  thus  be 
readily  seen.  The  next  section  demonstrates  a  technique  which  uses  raw 
failure  data  in  identifying  the  best  reliability  model. 

Identifying  the  model.  Ten  electronic  components  have  been  placed  on 
test  for  a  continuous  period  until  all  ten  failed.  Hie  time  of  failure 
of  each  unit  was  recorded  in  Table  20-1. 


Table  20-1 


Failure  Data 


Failure 

Total  Operating 
Time 

1 

8 

2 

20 

3 

34 

4 

46 

5 

63 

6 

86 

7 

111 

8 

141 

9 

186 

10 

266 

The  problem  is  to  find  the  reliability  of  the  component,  in  order  to  do 
so,  the  model  of  best  fit  must  be  determined.  A  logical  place  to  start 
is  to  plot  the  failure  data  to  determine  whether  time  to  failure  follows 
a  known  distribution.  The  density  function  of  time  to  failure  can  be 
defined  as: 


f  (t) 


N 


At. 


where. 


n )  *  number  of  surviving  items  at  the  beginning 
of  interval  t^ 


N 

At. 


total  number  of  items  placed  on  test 


length  of  interval,  i.e.,  amount  of  time 
elapsed  since  end  of  last  interval 


Thus,  for  the  first  failure 


f(t) 


fro.,-  ?] 

10 


8 


100 


1.25 


To  simplify  plotting  the  function,  100-hour  intervals  for  t  have  been 
established.  For  failure  number  two: 


J9  ~  8] 


f(t) 


10 


12 


•  100  »  0.84 


Table  20-2  shows  the  values  computed  for  each  failure.  The  plot  of 
f(t)  is  given  in  Figure  20-16. 


Next,  the  hazard  function  is  plotted  as  shown  in  Figure  20-17.  The 
hazard  function  is  defined  as  s 


[ntt^  -  -  Ati)] 


z(t) 


n(t±) 


At. 


2 


w 


For  failure  1: 


fro  - 

z(t)  -  - ^ -  •  100  -  1.25 

Note  that  t  is  also  presented  in  100-hour  increments  to  simplify 
plotting. 

For  failure  2: 

.  I2_-  .8],  , 

z  It)  -  - -  •  100  =  0.93 


The  values  computed  for  z(t)  for  all  failures  is  given  in  Table 

20-2. 


By  inspecting  Figures  20-16  and  20-17,  it  can  be  seen  that  time  to 
failure  appears  to  approach  a  constant  hazard  model,  i.e.,  exponential 
distribution.  Reliability  for  this  component  can  then  be  ascertained 
as  described  earlier  for  the  constant  hazard  model.  Reliability  from 
the  raw  data  can  be  computed  directly  for  intervals  at  any  of  the 
failure  times  as  follows: 


n(t^) 

N 


where  n(t^) 


number  of  survivors  up  to  and  including  the 
1th  failure 


N  ■  total  number  of  items  placed  on  test 


Table  20-2 


Failure  i 

Operating 

Time 

t 

f(t) 

Z(t) 

1 

8 

8 

1.25 

1.25 

2 

20 

12 

0.84 

0.93 

3 

34 

14 

0.72 

0.96 

4 

46 

12 

0.84 

1.19 

5 

63 

17 

0.59 

0.98 

6 

86 

23 

0.44 

0.87 

7 

111 

25 

0.40 

1.00 

8 

141 

30 

0.33 

1.11 

9 

186 

45 

0.22 

1.11 

10 

266 

80 

0.13 

1.25 

20-19 


100 


200 


t  (hours) 


allure,  t. 


46  hours 


0.60 


for  a  46-hour  period  is  0.60. 
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At  this  point  a  note  of  extreme  caution  must  be  made.  Reliability 
is  a  function  of  the  number  of  failures,  which  -makes  the  sample  size  of 
of  paramount  importance.  When  the  sample  size,  i.e.,  the  number  of 
failures,  is  small,  the  analyst  runs  the  same  risk  in  predicting 
reliability  that  is  usually  encountered  in  other  forms  of  statistical 
estimation.  At  the  same  time,  the  costs  in  dollars  and  time  necessary 
to  achieve  an  adequate  amount  of  failure  data,  is  almost  always  more 
than  what  is  available  The  manager,  theh,  is  forced  to  make  a  critical 
trade-off  decision  in  this  area. 

Continuing  with  the  example,  when  a  large  number  of  items  is  placed 
on  test,  the  previous  procedure  may  be  cumbersome .  An  alternate  method 
exists  for  using  a  class  interval  structure  to  analyze  failure  data  for 
identifying  an  appropriate  reliability  model. 

In  Table  20-3,  the  number  of  hours  for  the  testing  period  is  divided 
into  1,000-hour  intervals  along  with  the  number  of  failures  occurring 
during  each  interval. 


Table  20-3 


Hours 

Ati 

Humber  of 
Failures 

f(t) 

z(t) 

0-1000 

59 

3.43 

3.43 

1001-2000 

24 

1.40 

2.12 

2001-3000 

29 

1.69 

3.26 

3001-4000 

30 

1.74 

5.00 

4001-5000 

17 

0.99 

5.69 

5001-6000 

13 

0.76 

10.00 

172 

Again,  the  test  is  run  until  all  items  fail.  The  same  formulation 
is  used  as  with  the  individual  failure  items  except  that 

t^  »  time  at  the  beginning  of  the  class  interval 

At^  *  length  of  the  class  interval 

As  usual  with  grouped  data,  failures  are  assumed  to  occur  evenly 
throughout  each  class  interval. 


The  failure  density  for  class  number  3  is  found  by: 


f(t) 


[89  -  60 1 
172 
1000 


1.69 


© 


4 

The  resulting  density  is  multiplied  by  10  to  simplify  graphing  the  data. 
The  hazard  function  for  the  same  class  interval  is  found  by: 


z(t) 


29 

89 

1000 


3.26 


When  a  large  amount  of  failure  data  is  available  or  planned;  an 
appropriate  class  interval  can  be  constructed  using  Sturges'  rule. 

k  =  1+3.3  log  n 


where  k  =  number  of  classes  rounded  to  nearest  integer 

n  =  total  number  of  failures  observed. 

Although  it  is  not  necessary,  it  is  usually  best  to  make  class  intervals 
of  equal  width. 

As  a  final  note  in  this  area,  the  use  of  grouped  data  has  a  dis¬ 
advantage  in  that  the  exact  time  elapsed  to  each  failure  is  lost  within 
the  class  intervals.  This  loss  of  individual  failure  data  items  may 
affect  the  precision  of  the  reliability  estimates,  but  will  usually  not 
hinder  the  identification  of  the  appropriate  reliability  model. 

Curve  Fitting.  In  many  instances,  inspecting  a  plot  of  time  to  failure 
data  will  not  provide  an  identifiable  probability  distribution.  Several 
curve-fitting  techniques  sure  available  which  will  define  a  specific 
distribution  for  a  given  set  of  failure  data.  In  addition  to  these 
curve-fitting  techniques  which  are  not  discussed  in  detail  here,  statis¬ 
tical  goodness-of-fit  tests  can  be  easily  applied  which  give  the  manager 
reasonable  assurance  that  a  set  of  data  follows  a  stated  distribution. 
The  Chi-square  goodness-of-fit  test  as  well  as  the  Kolmogorov- Smirnov 
test  is  described  in  detail  in  Chapter  22. 

SYSTEM  RELIABILITY 

As  stated  earlier,  system  reliability  depends  upon  the  reliability 
of  each  of  the  components  as  well  as  their  configuration.  A  previous 
example  has  shown  that  a  system  can  be  more  reliable  than  its  weakest 
(least  reliable)  component.  Although  this  concept  may  be  contrary  to 
the  old  adage  that  a  "chain  is  only  as  strong  as  its  weakest  link,"  it 
is  a  vital  concept  in  understanding  the  advantages  and  disadvantages  of 
the  various  configuration  possibilities  in  system  design.  For  example, 
a  parallel  configuration  yields  a  higher  system  reliability  than  a 
series  configuration  if  the  same  components  sure  used  in  each  design. 


20-22 


In  the  final  analysis ,  then,  system  reliability  can  be  said  to  be 
related  to  the  following  three  basic  areas: 

1.  System  design  configuration 

2.  Component  reliability 

3.  Rate  of  failure 

Methodologies  for  determining  system  reliability  based  on  the  three 
areas  are  presented  in  the  sections  which  follow. 

Series  Configuration.  For  the  constant  hazard  model  (exponential) 
system  reliability  is  found  by: 

n  -At 

R(t)  =  He1"  exp  - 
i=l 

where  n  =  number  of  components  in  series 
=  failure  rate  of  component  i 

In  using  this  formulation,  the  following  assumptions  are  made: 

1.  Components  are  connected  in  series 

2.  Components  are  independent,  i.e.,  failure  of  one  has  no  effect 
on  another  component. 


n 

l  K 

i=l  ' 


EXAMPLE  20-4 


An  electronic  device  has  three  components  with  equal  reliability  as 
illustrated  below. 


Figure  20-18 


For  t  s  10  and  X.  =  .0054,  find  the  system  reliability  and  mean  time  to 
failure. 

„  -[.0054  +  .0054  +  .0054^  CIO) 

R  ■*  e 
s 

- (.0162)  (10)  „ 

=  e 


=  0.85 


Note  that  the  system  reliability  can  also  be  found  by  using  the 
probability  law  of  intersection,  namely  that 

r  =  r  n  r.  n  r 

s  a  d  c 


or  R 

s 


*  R 

c 


=  ( .95) 3 


«  0.85* 

* 

Third  decimal  place  will  differ  due  to  rounding  of  X  and  R^ . 
MTTF  is  found  by 


MTTF 


1 


.0054  +  .0054  +  .0054 
61.7  hours 


For  a  linear  increasing  hazard  model,  system  reliability  can  be 
found  by : 


n 

IT  e 
i=l 


V 
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System  reliability  for  this  design  is: 

/-  1 

2  (l  -  e-0054(10>) 
i-1 


R  CIO}  »  1- 
s 


1  -  (.0257) 


0.9998 


Using  the  probability  law  for  union,  the  same  reliability  is  found  by: 

R  Ct)  =  R  U  R.  U  R„ 
s  a  t>  c 

R  (t)  »  R  +  R  +  R  -  R  *  R.  -  R.  *  R  -  R  *  R  + 

s  abc  abbc  ac 

R  •  R.  *  R 
abc 

-  3( .95)  -  3 (.95) 2  +  (.95) 3 

«  0.9998* 


*  Fifth  decimal  place  will  be  different  due  to  rounding  of  X  and  R. 


Also  using  the  law  of  union,  the  WPTTF  is  found  by: 

1  1 


1  .  1  .  1 
KTTF  *  t —  +  t —  + 


X  X.  X  X  +  X. 
a  b  c  a  b 


\  +  X 

b  c 


X  +  X 

a  c 


X  +  X.  +  X 
a  b  c 


'(.0054)"  3  (.0054  +  .0054)  +  (  3(. 0054)) 


339.51  hours 
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System  reliability  can  be  found  for  the  Weibull  model  in  a  similar 
manner.  If  p  components  have  a  constant  hazard  -model  and  n  -  p  compo¬ 
nents  have  a  linear  increasing  hazard,  system  reliability  c an  be  readily 
found  by: 


R  (t) 
s 


UL=p+l 


e 


Parallel  Configuration.  For  components  connected  in  parallel,  reliabil¬ 
ity  for  the  system  is  determined  in  the  following  manner. 

Recall  from  probability  theory  that 

P  (failure)  =  1  -  P  (success) 

and  R(t)  =  1  -  F(t) 

For  the  constant  hazard  model,  then,  system  reliability  is: 


R  (t)  *  1  - 

s 


n 


n  (i  - 

i=l 


-X .  t) 
e  i 


The  mean  time  to  failure  is  found  by  using  the  union  law  for  probability. 
EXAMPLE  20-5 


Using  the  same  three  components  as  in  the  example  for  the  system  in 
series,  the  following  parallel  design  results: 

Figure  20-19 


MTTF  can  also  be  found  by: 


*  339.51  hours 

* 

Only  true  for  parallel  components  and  when  component  reliability  is 
equal  for  all  components. 

From  the  example  above, it  can  readily  be  seen  that  both  MTTF  and 
system  reliability  have  increased  considerably  over  that  which  was 
achieved  in  the  series  configuration.  This  i3  due  to  redundancy  which 
exists  in  the  parallel  design.  The  system  designer  must  weigh  the 
alternatives,  i.e.,  the  additional  cost  and  weight  added  to  the  system 
with  redundant  components  against  the  increased  reliability  MTTF. 

In  summary,  this  chapter  has  only  included  a  brief  introduction  to 
the  basic  concepts  of  reliability.  Other  topics  such  as  reliability 
allocation,  switching,  and  reliability  growth  are  contained  in  publi¬ 
cations  devoted  to  a  more  complete  treatment  of  the  subject.  The  user 
should  consult  these  prior  to  attempting  to  solve  problems  which  are 
other  than  the  most  basic. 


CHAPTER  21 


ECONOMIC  CONCEPTS 

21-1  PRESENT  VALUE  CONCEPTS 


A  dollar  In  hand  today  is  worth  more  than  a  claim  to  a  dollar 
tomorrow — that  is  the  essence  of  the  time  value  of  money.  This  state¬ 
ment  implies  that  money  has  the  power  to  earn  money.  Its  earning  power 
is  related  to  the  productiveness  of  capital  investment  it  makes  possible 
and  the  impatience  on  the  part  of  those  who  would  rather  borrow  to  spend 
today  them  to  save  and  spend  tomorrow.  Earning  power  is  expressed  as  a 
rate  of  interest.  Thus,  money  deposited  in  a  bank  savings  account  is 
said  to  earn  interest  at  some  stipulated  rate  per  year  or  other  time 
period. 

In  illustration,  a  bank  deposit,  P,  earns  interest  at  some  rate,  i, 
per  interest  period.  At  the  end  of  one  interest  period,  it  will  grow 
to  some  amount.  A,  according  to  the  following  formula: 

Q)  P  (1+i)  =  A 

For  any  number  of  interest  periods,  n,  P  will  grow  to  A  according  to  the 
following  formula: 

(2)  P  (1+I)n  =>  A, 

the  term  in  the  parenthesis  being  referred  to  as  the  compound  amount 
factor,  or  simply,  CAF.  We  can  express  (2)  alternatively  as: 

(3)  P  =»  A  --1  ■  ' 

(1+i) 

where  P  is  shown  as  the  present  value  of  some  future  amount.  A,  and  the 

term  — - -  ,  is  referred  to  as  the  present  value  factor,  or  simply, 

(1+i) 

PVF.  Thus,  if  an  individual  wished  to  accumulate  some  amount  a  at  the 
end  of  n  interest  periods,  given  that  the  interest  rate  were  i,  it 
would  be  a  simple  matter  to  calculate  the  amount,  P,  necessary  to 
achieve  that  goal.  That  is,  P  is  the  present  value  of  the  future 
amount,  A. 

Such  a  limp  sum  is  in  someways  similar  to  a  direct  investment 
in  productive  capital  from  which  flow  goods  or  services  with 
a  value  that  exceeds  the  amount  of  the  investment.  The  difference 
between  the  value  of  output  and  the  investment  cost  of  the  capital  is 


the  total  return  on  the  capital.  Ordinarily,  a  capital  asset  is 
expected  to  produce  output  reasonably  steadily  over  its  lifetime  gen¬ 
erating  a  flow  of  revenue  sufficient  to  provide  a  complete  recapture  of 
its  invested  cost  and  a  return  thereon.  Thus,  the  revenue  that  is 
realized  periodically  from  produced  output  is  partially  a  return  of 
capital  and  partially  a  return  on  capital.  Given  an  interest  rate  or 
rate  of  return,  i,  that  is  desired  for  some  investment,  P,  over  its 
life,  n,  one  can  determine  the  periodic  and  uniform  amount,  R,  necessary 
to  satisfy  those  conditions.  We  can  express  P  as  the  present  value  of 
the  future  stream  of  revenue  according  to  the  following  formula: 


1 

(1+i) 


1 

(1+i)' 


1 

(1+i) 


1 

(l+i)n 


Since  R  is  uniform,  (4)  cam  be  expressed  as: 


(5)  P  =  R 


f  (l+i)n-l  1 

[_  i  (l+i)n  J 


the  term  in  brackets  being  referred  to  as  the  series  present  value 
factor,  or  simply,  SPVF.  Alternatively,  we  can  express  R  as  the  amount 
of  revenue  recurring  periodically  for  some  investment,  P,  over  its  life, 
n,  with  a  rate  of  return,  i. 


R  =  P 


f  i(l+i)n  1 
[  (l+i)n-l  J 


The  term  in  brackets  being  referred  to  as  the  capital  recovery  factor, 
or  simply,  CRF. 

If  from  (4)  we  can  calculate  the  present  value  of  a  future  revenue 
streaun,  we  cam  also  use  the  formula  to  calculate  the  present  value  of  a 
stream  of  periodic  costs,  C.  Thus, 


P  =  C 


1 

(1+i) 


1 

(1+i)' 


1 

(1+i ) n 


Accordingly,  for  a  nonrevenue  producing  public  program,  we  aure  enabled 
to  calculate  the  present  value  of  all  costs  by  (7),  providing  we  know 
the  interest  rate  or  rate  of  return,  i,  to  use.  For  a  given  rate  of 
return,  we  can  calculate  the  present  value  of  costs  for  each  of  a  number 
of  competing  alternatives  designed  to  satisfy  fully  stated  program 
objectives. 


The  suggestion  of  applying  the  concept  of  the  time  value  of  money  to 
government  programs  may  seem  farfetched  indeed,  and  yet  it  has  the  very 
same  significance  to  money  spent  for  government  purposes  as  it  does  to 
private  enterprise.  Whatever  money  is  spent  for  government  purposes  has 
alternative  private  uses.  Those  alternative  uses  for  money  would  produce 
an  investment  return  or  would  satisfy  someone's  impatience  to  consume 
now.  It  is  reasonable  to  expect  government  purposes  to  achieve  results 
as  beneficial  as  the  alternative  uses  to  which  the  money  might  otherwise 
be  put. 

However,  a  problem  arises  in  applying  the  time  value  concept  to 
evaluate  government  program  alternatives  in  that  the  opportunity  rate  of 
return  is  open  to  interpretation.  For  those  of  us  in  DOD,  however,  this 
has  already  been  decided.  DODI  7041.3,  18  October  1972,  directs  that  an 
annual  interest  rate  of  10%  shall  be  used  to  evaluate  program  proposals 
involving  time  periods  exceeding  one  year.  That  interest  rate  is  thought 
to  reflect  the  economic  character  of  alternative  uses  for  the  money  DOD 
proposes  to  use  for  its  purposes.  Thus,  a  planner  can  use  (7)  to  eval¬ 
uate  feasible  alternatives  that  will  satisfy  the  objectives  of  a  DOD 
program  over  its  anticipated  life  and  find  which  is  the  least  costly  in 
present  value  terms. 

Evaluating  alternatives,  however,  is  not  as  straightforward  a  matter 
as  (7)  might  suggest.  The  life  of  an  alternative  proposal  is  only 
infrequently  identical  to  that  of  the  program  it  is  designed  to  satisfy 
or  to  that  of  another  alternative  with  which  it  is  competing.  In  this 
respect  it  is  important  to  understand  that  competing  alternatives  must 
be  evaluated  over  the  same  time  interval.  Thus,  program  life  is  the 
correct  time  interval  for  evaluating  each  alternative  proposal.  This 
means  that  for  alternatives  with  lives  shorter  than  program  life,  the 
present  value  of  replacing  an  original  investment  in  hardware  and  facili¬ 
ties  together  with  the  present  value  of  any  residual  items  at  the  end  of 
the  program  must  be  taken  into  consideration.  For  an  alternative  with 
a  life  greater  than  the  program  life,  the  present  value  of  residuals  at 
the  end  of  the  program  are  recognized  in  like  manner. 

Naturally,  every  planning  problem  involving  the  concept  of  present 
value  is  not  related  to  a  new  program.  Many  such  problems  are  related 
to  existing  programs  in  which  the  alternatives  are  to  replace  or  continue 
use  of  existing  facilities.  Conditions  under  which  a  defense  program  is 
implemented  may  very  well  change  during  its  life,  and  the  means  to  satisfy 
its  continuing  objectives  may  also  change.  Thus,  it  is  entirely  possible 
that  replacement  of  existing  facilities  and  hardware  may  be  economically 
feasible;  present  value  analysis  will  confirm  whether  or  not  replacement 
should  take  place. 

These  two  basic  problems  of  choosing  among  competing  alternatives  to 
satisfy  the  objectives  of  a  new  program,  and  choosing  between  an  in-place 
alternative  and  a  proposed  replacement  for  an  already  implemented  program 
are  illustrated  in  the  following  examples. 
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EXAMPLE  21-1 s 


A  U.  S.  Navy  shipyard  is  to  be  equipped  with  new  metal  working 
facilities  to  satisfy  increased  fleet  requirements  anticipated  over  the 
next  30  years.  Three  alternatives,  each  with  a  different  useful  life 
are  being  considered.  Their  investment  costs  and  annual  operating  and 
maintenance  costs  sure  shown  below.  None  of  the  alternatives  will  have  a 
salvage  value  at  the  end  of  its  life. 

Table  21-1 


1 

Alternatives 

2 

3 

Life 

5 

10 

15 

Required  Investment 

$1,000 

$2,700 

$3,500 

Annual  O&M  Costs 

600 

500 

200 

Present  value  of  program  life  cycle  costs 

at  10%  interest 

factor 

Investment  +  PVF  of  Annual  Costs  over  Program  Life 
Alternative  Is  $1,000  +  600  (SFVF5)  +  PVP5  11000  +  600  (SPVFg)] 

+  PVF1q  [1000  +  600  (SPVF,. )  3  +  .  .  .  +  PVF25  [1000  +  600 
(SPVFg]  «  1000  +  600  (3.791)  +  1.487  (3275)  =  $8135 

Alternative  2:  $2,700  +  500  (SPVF^)  +  (PVF10  +  PVF2())  [2700  +  500 

(SPVF1q)] 

2700  +  500  (6.145)  +  (.535) [2700  +  500  (6.145)3  =  $8865 

Alternative  3:  $3,500  +  200  (SPVF15>  4  (P^Fg)  13500  +  200  (SPVF  ) 3 

3500  +  200  (7.606  +  .239  [3500  +  200  (7.606)3  =  $6215 

We  observe  that  the  first  alternative,  if  chosen,  would  be  replaced  five 
times  during  the  30-year  program  life,  and  that  the  second  alternative 
would  be  replaced  twice,  while  the  third  alternative  would  be  replaced 
once.  In  each  case,  the  present  value  of  replacement  investments  has 
been  calculated.  We  have  assumed  zero  salvage  value  for  each  of  the 
alternatives,  but  if  there  were  salvage  value,  its  present  value  should 
be  calculated  appropriately  and  subtracted  from  the  present  value  of  all 
alternative  costs. 


21-4 


From  the  above  calculations,  we  can  see  that  Alternative  3  with  its 
higher  investment  cost  but  lower  Q&M  costs  has  the  lowest  present  value 
life  cycle  costs. 

EXAMPLE  21-2; 

The  transient  aircraft  repair  facility  at  a  certain  U.S.  Air  Force 
Base  has  become  inadequate  to  accommodate  recent  vintage  jet  aircraft. 
Because  of  the  inadequacy,  long  delays  are  commonplace  in  servicing  and 
repair  operations .  It  has  been  decided  to  improve  the  facility  either 
by  modification  or  replacement.  The  existing  facility  has  a  salvage 
value  of  approximately  $90,000.  Modification  investment  costs  are 
estimated  to  be  $100,000.  Annual  operating  costs  of  the  modified  facility 
are  estimated  to  be  $60,000.  Alternatively,  the  facility  could  be 
replaced  with  a  new  plant  costing  $300,000  but  for  which  annual  operating 
costs  would  be  only  $35,000.  The  rejuvenated  facility  would  have  an 
expected  life  of  20  years,  after  which  it  would  have  no  salvage  value. 

The  replacement  facility  is  estimated  to  have  a  salvage  value  of  $80,000 
at  the  end  of  20  years. 

Present  value  of  program  life  cycle  costs  at  10%  interest  factor 
Modified  Facility:  $190,000  +  60,000  (SPVF2Q) 

=  190,000  +  60,000  (8.5136)  *  $700,816 

Replacement  Facility:  $300,000  +  35,000  (SPVF^) 

-  80,000  (PVF20) 

*  300,000  +  35,000  (8.5136) 

-  80,000  (.1486)  =*  $586,088 

We  observe  that  modifying  the  existing  facility  would  be  the  more  expen¬ 
sive  alternative  even  though  it  would  require  a  lesser  investment  to 
make  it  the  operational  equal  of  the  replacement  alternative. 


21.2  MINIMIZING  VARIABLE  COSTS  TO  PRODUCE  A  GIVEN  LEVEL  OF  OUTPUT  IN  TWO 
ALTERNATIVE  PRODUCTION  FACILITIES 


In  producing  any  single  item  at  some  given  level  of  output,  the  man¬ 
ager  of  defense  production  facilities  seeks  to  do  so  at  minimum  total 
cost.  The  available  resources  with  which  to  produce  output  may  very  well 
include  two  or  more  fixed  facilities  each  capable  by  itself  of  meeting 
the  production  goal.  Should  the  manager  accordingly  choose  one  or  another 
facility  based  on  which  has  the  least  average  cost  to  produce  the  given 
level  of  output? 


Somewhat  surprisingly,  minimum  cost  to  produce  can  usually  be 
achieved  by  allocating  output  among  all  the  alternatives  simultaneously. 
Least  cost  production  requires  that  output  be  allocated  among  alternative 
facilities  so  that  their  marginal  costs  are  equal.  Marginal  cost,  MC, 
of  course,  is  the  rate  of  change  in  total  cost,  TC,  with  respect  to  the 
rate  of  change  in  output,  x.  Thus, 

MC  =  d(TC)T 

dx 

Accordingly,  the  invested  cost  of  an  alternative  facility  has  nothing 
whatsoever  to  do  with  the  problem.  It  is  there  and  already  paid  for; 
the  only  question  is  how  variable  resources  such  as  labor  and  material 
that  are  subject  to  control  earn  be  combined  with  the  fixed  facilities  to 
produce  output  in  the  least  costly  manner.  Thus,  a  manager  of  in-place 
facilities,  in  evaluating  their  use,  need  concern  himself  with  variable 
costs  only. 

The  concept  of  minimum  cost  output  can  be  illustrated  in  the  follow¬ 
ing  terms.  Two  production  facilities,  A  and  B,  each  have  the  capacity 
to  produce  a  given  level  (say  10)  of  output  of  a  single  item.  The  unique 
total  variable  cost  (TVC)  curve,  average  variable  cost  (AVC)  curve  and 
marginal  cost  (MC)  curve  of  each  is  displayed  in  Figure  21-1. 

It  cam  be  seen  that  average  cost  to  produce  in  A  falls  initially  to 
some  minimum  value  then  rises,  but  for  any  level  of  output  greater  than 
10  units  is  less  than  average  cost  in  B  for  the  same  output.  This  would 
seemingly  suggest  that  to  produce  any  output  greater  than  10  units  at 
minimum  cost,  facility  A  should  be  used. 

But  a  glance  at  the  marginal  cost  curves  of  the  two  facilities  tells 
us  that  the  marginal  cost  at  the  given  level  of  output  in  A  exceeds 
marginal  cost  for  lesser  levels  of  output  in  B.  If  some  of  a  total  num¬ 
ber  to  be  produced  were  reallocated  to  B  so  that  both  facilities  produce 
simultaneously,  would  total  cost  be  reduced?  Yes,  indeed.  And  so  long 
as  MC  in  B  were  less  than  MC  in  A,  total  cost  could  be  reduced  further 
by  successive  reallocation  to  B  from  A.  When  MC  in  B  were  equal  to  A, 
minimum  cost  allocation  would  be  achieved. 

The  reason  that  total  cost  is  reduced  when  MC  in  B  is  less  than  in  A 
is  that  by  a  reallocation  of  one  unit  from  A  to  B,  total  cost  is  reduced 
more  in  A  than  it  is  increased  in  B.  In  more  general  terms,  the  minimum 
cost  allocation  for  any  output  level,  Q,  would  be  as  shown  in  Figure  21-2 

where  MC,  ■  MC  . 

A  B 
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Figure  21-2 


Naturally,  the  total  cost  to  produce  the  required  level  of  output  is  the 
sum  of  the  total  cost  in  A  to  produce  x  units  and  the  total  cost  in  B  to 
produce  (Q-x)  units. 

The  cost  and  output  characteristics  of  production  facilities  can 
sometimes  be  fitted  to  algebraic  curves  as  total  variable  cost  functions 
Where  this  is  possible,  the  solution  for  minimum  cost  allocation  can  be 
determined  by  mathematical  techniques.  To  illustrate  the  procedure, 
let  us  assume  that  the  two  production  facilities  A  and  B  have  the  fol¬ 
lowing  total  variable  cost  functions 


TVCa  =  |x3  -  5 Ox2  +  4610x 


TVCb  -  3x3  +  180x2  +  2000x 

where  x  represents  the  unit  of  output. 


A  level  of  70  units  of  output  must  be  produced.  Each  facility, 
remember,  has  the  capacity  to  produce  the  70  units  by  itself.  Output 
is  to  be  allocated  between  them  so  as  to  minimize  cost.  In  order  to 
solve  for  minimum  cost  output,  we  must  first  combine  the  two  functions 
into  one  common  expression.  Thus, 

TVC  =  TVC.  +  TVC_ 

A  B 


But  in  doing  this,  we  must  remember  that  of  a  total  output  of  70, 

the  number  allocated  to  A  is  not  allocated  to  B.  Accordingly ,  we  take  x 

as  the  variable  of  output  in  A  and  substitute  for  the  variable  of  output 

in  B,  the  variable,  (70  -  x) .  The  total  variable  cost  function  now 

becomes 

TVC  =  |x3  -  5 Ox2  -  4610x  +  3  (.70-x)  3  +  180 (70-x) 2  +  2000 (70-x) 

We  know  that  to  minimize  cost 


Me 


d  (TVC) 
dx 


0  and 


d  (MC) 
dx 


>  0 


In  differentiating  TVC,  collecting  terms  and  setting  the  result 
equal  to  zero,  we  have 

MC  =  3x2  -  608x  +  26676  =  0 

Solving  this  quadratic  equation,  we  get 

x  =  64,  138 

Testing  the  second  derivative  at  the  two  solution  values  of  x,  we 

have 


d  CMC) 
dx 


for  64  >  0; 


d  (MC) 
dx 


for  138  <  0 


From  calculus  we  know  that  a  relative  minimum  value  of  the  dependent 
variable  (in  this  case,  TVC)  exists  if  the  second  derivative  is  positive 
when  evaluated  at  some  value  of  the  independent  variable  (in  this  case,  x 

Accordingly,  our  allocation  for  minimum  cost  output  must  be 

Facility  A:  “64 


Facility  B:  (70-x)  =  6 


ft^U2^9^0Ufl!^nfinVETOOLSFOR  THE  LOGISnCSNANAGEROJ^IR 
FORCE  INST  OF  TECH  HRIGHT-PATTERSON  flFB  OH  SCHOOL  OF 
SVSTEHS  AND  LOGISTICS  J  E  ENGEL  APR  80  AFIT-LS-32 

F/G  lS/'S 


7/7 

NL 


1 


UNCLASSIFIED 


The  question  arises  why  a  total  variable  cost  curve  which  should  be 
expected  always  to  rise  for  increasing  output  should,  in  this  example, 
have  an  unseemly  relative  minimum  value  and  a  relative  maxi mum  value. 
The  curve,  TVC  *  TVC  +  TVC  when  plotted  in  terms  of  the  variable  of 

A  o 

production  in  A  looks  like 


Figure  21-3 
TOTAL  VARIABLE  COST 

(in  terms  of  the  variable,  x,  in  facility  A) 


We  can  observe  that  it  falls  from  an  initially  high  value  to  a 
relative  minimum  at  approximately  64  units  of  output,  according  to  our 
solution,  as  more  and  more  relatively  high  MC  units  in  B  are  reallocated 
as  relatively  low  MC  units  in  A.  But  further  allocation  from  B  to  A  in 
excess  of  64  results  in  increasing  total  variable  cost  as  marginal  cost 
in  A  for  each  reallocated  unit  exceeds  the  marginal  cost  in  B.  Beyond 
70  units  of  output,  the  curve  has  no  relevance  to  our  problem,  since 
for  that  dictated  level,  any  allocation  greater  than  70  to  A  would 
require  a  fictitious  allocation  of  negative  units  to  B. 

Two  considerations  suggest  that  a  further  development  of  least  cost 
allocation  can  be  made.  First,  it  is  possible  to  make  a  plot  of  the 
allocation  between  two  production  facilities  against  required  levels 
of  total  output,  making  it  easy  enough  to  determine  allocation  for  what¬ 
ever  level  of  output  is  imposed  on  the  facilities.*  Alternatively,  a 
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computer  program  can  be  developed  for  the  allocation.  Seoond,  if  the 
number  of  alternative  facilities  exceeds  two,  minimus  cost  allocation 
still  requires  equality  of  marginal  costs}  however/  the  solution  process 
is  more  complicated  and  is  the  subject  for  a  later  treatment  of  this 
basic  idea. 

Often*  the  production  process  in  a  facility  does  not  lend  itself 
to  analytical  expression/  and  so  least  cost  allocation  is  a  trial  and 
error  proposition.  Thus,  in  making  an  initial  allocation  of  variable 
labor  and  materials  among  alternative  facilities,  the  production  manager 
may  rely  primarily  on  a  combination  of  intuition  and  experience. 

In  refining  the  initial  allocation/  he  should  seek  to  reduce 
total  cost  for  each  successive  allocation.  So  long  as  the  cost  to  pro¬ 
duce  an  additional  unit  in  one  facility  is  less  them  that  of  other 
facilities/  output  in  that  facility  should  be  increased  by  a  reduction 
in  output  in  others;  total  variable  cost  will  fall.  When  MC  is  the  same 
in  each  facility,  total  variable  cost  can  be  reduced  no  further. 
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CHAPTER  22 


NONPARAMETRIC  STATISTICS 


INTRODUCTION 


Frequently  it  is  desired  to  conduct  statistical  analyses  under  con¬ 
ditions  which  do  not  permit  the  assumption  of  a  parametric  technique  to  be 
made  or  the  requirement  of  such  a  technique  to  be  met.  In  most  instances, 
there  exists  a  corresponding  nonparametric  or  "distribution  free"  statistic 
which,  although  generally  less  powerful,  may  be  employed  since  the  assump¬ 
tions  are  less  restrictive.  In  this  chapter  a  representative  selection  of 
these  nonparametric  statistics,  chosen  to  include  those  which  are  the  most 
useful  to  the  logistician,  is  presented.  They  are  grouped  into  three  major 
classifications:  measures  and  tests  involving  nominal  (categorical)  level 
data,  measures  and  tests  involving  ordinal  level  data,  and  goodness  of  fit 
tests . 


MEASURES  AND  TESTS  INVOLVING  NOMINAL  (CATEGORICAL)  LEVEL  DATA 

A.  A  One-Sample  Test  of  Occurrence  Frequencies/Probabilities:  Chi 
Square  Row  Vector. 

General  description/purpose:  A  chi  square  row  vector  may  be  used  to 
test  whether  a  statistically  significant  difference  exists  between  an 
observed  number  of  subjects,  objects,  or  responses  falling  into  each  of 
a  set  of  categories  (cells)  and  an  expected  number  based  on  hypothesized 
frequencies/probabilities  of  occurrence  within  the  categories 

Assumptions/limitations : 

1.  The  null  hypothesis  (Hq)  completely  states  the  proportions  of 
objects  falling  into  the  categories  (cells)  of  the  presumed  population  so 
that  the  expected  frequencies  can  be  deduced. 

2.  Category  (cell)  size  limitations  (criteria): 

a.  If  the  degrees  of  freedom  (df)  equals  1,  the  expected  fre¬ 
quency  of  each  category  (cell)  (E^ i  i  ■  l,2,..^c)  should  be  >.5. 

If  df  >_2,  each  E^  should  be  >.  1  and  at  least  80%  of  the  Ejs 
should  be  >_  5 . 

b.  Expected  frequencies  may  be  increased  by  combining  adjacent 
categories  (cells) ,  but  this  is  desirable  only  if  such  combinations  can  be 
meaningfully  made. 
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c.  If  either 


(1)  One  starts  with  only  two  categories  (cells)  and  has  one 
(or  both)  E^  <  5, 

(2)  or,  if  after  combining  adjacent  categories  (cells) ,  one 
ends  up  with  only  two  categories  (cells)  and  still  has  one  (or  both)  <  5 
then  the  binomial  test  (see  "Other  test(s)")  should  be  used  rather  than 
the  chi  square. 

Discussion/methodology s  The  procedure  for  application  of  the  chi  square 
row  vector  teat  involves  these  steps: 

1.  Cast  the  observed  frequency  of  each  category  (cell)  (0^;  1  *  1,2, 

.  .^c)  into  its  corresponding  component  of  a  c— dimensional  row  vector.  The 
sum  of  the  frequencies  should  be  N,  the  number  of  independent  observations. 


2.  Determine  the  degrees  of  freedom:  df  ■  c  -  1. 

3.  From  Ho,  determine  the  expected  frequency  of  each  category  (cell): 

1  hypothesized  frequency  of  category  (cell)  i, 

I  as  stated  in  Hq 


E  « 


<  or 

(hypothesized  probability  of  category  (cell)  i, 
as  stated  in  Hq)  •  N 


i  *  1,  2,  . . . , c 


When  df  >  2,  if  more  than  20%  of  the  Ej.'s  are  <  5,  combine  adjacent  cate-* 
gories  (cells) ,  where  this  is  reasonable ,  until  the  category  (cell)  size 
limitations  (criteria),  are  met.  (Note:  The  value  of  c  will  be  reduced, 
while  the  values  of  some  of  the  Ej/s  will  be  increased.)  When  df  ■  1, 
use  this  test  (approximately)  only  if  both  Ejs  are  >.  5. 

The  data  may  be  conveniently  displayed  as  follows: 


Categories  Total 


c 

where  H  ■  I  0, 
i-1 


2  2 

4.  Compute  the  value  of  the  test  statistic,  x  observed  or  xQ: 


o2 

MB 

E3 

■ 

N 

■ 

E2 

HI 

■ 

H 

if  df  -  1  (c-2) 


f  «  <|0j  —  Ell-1.)2 
M  Ei 


or 


c  (0i- 
i-1  Ei 


if  df  >  2  (e  >  3) 


S.  Determine  the  critical  value  for  the  test  statistic,  X  critical 


or 


fxJ,df  -  xl  ,  c-1 


2a, df  2a, c-1 


if  a  two-tailed  test  at  the 
a-level  of  significance 


if  a  one-tailed  test  at  the 
a-level  of  significance 


6. 


Resolve  the  hypothesis  test: 


if  X*  >  Xq,  reject  Hq. 


EXAMPLE  22-1 


The  AF-wide  mix  of  enlisted  personnel  in  a  particular  logistics  career 
field  is  83%  male  and  17%  female.  At  Boon  Docks  AFB,  79  enlisted  men  and 
21  enlisted  women  in  this  career  field  are  assigned.  Is  the  mix  of 
enlisted  personnel  at  this  base  typical  of  the  career  field?  Test  at 
a  «  0.10. 


HQ:  The  male  (M) /female  (F)  mix  of  enlisted  personnel  at  Boon 
Docks  AFB  is  typical  of  the  particular  logistics  career 
field,  i.e.,  P(M)==  0.83  and  P(F)s=0.17. 


Hj_:  It  is  atypical,  i.e.,  P(M)  /  0.83  and  P(F)  /  0.17 


Male  (M)  Female  (F)  N 


79J 

83 


17 


100 


df  »  c-1  ■  2-1  •  1 


E^j  -  P(Ml*H  -  (0.83)  a00)  -  83 
By  -  P(F) -  (0.17X100)  -  17 
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E  and  E  both  >.  5?  (yes) 


C"2  (jOi  _  Ei  |  -V* 

1 


i-1 


E. 


since  df  *  1 


_  (179-831  -ft)2  ( 1 21-17 1 -h) 2 

83  17 


2  0.87 


*e  ’  *ldf  -  *\l0,  1  -  2-n 


X2  *  0.87  <  2.71  -  x? 
o  c 


Conclusion:  Fail  to  reject  Hq.  There  is  insufficient  evidence  to  conclude 
that  the  male/female  mix  of  enlisted  personnel  at  this  base  is  atypical  of 
the  particular  logistics  career  field  at  the  a  -  0.10  level  of  significance. 

Reference (s) .  Siegel,  Sidney.  Nonpar  ame trie  Statistics  for  the  Behavioral 
Sciences.  (McGraw-Hill:  New  York,  1956)  pp.  42-47. 

Other  test(s).  Binomial  test,  see  Siegel,  pp.  36-42. 

B.  The  TWo  Related  Samples  Case:  The  McNemar  Test  for  Difference  or  Changes 


General  description/purpose:  The  McNemar  test  may  be  used  to  test 
whether  a  statistically  significant  difference  exists  between  two  treat¬ 
ments,  or  whether  one  treatment  is  "better”  than  another,  when  each  subject 
"serves  as  his  own  control"  by  being  exposed  to  both  treatments  at  dif¬ 
ferent  times.  It  is  particularly  applicable  to  "before  and  after"  treat¬ 
ment  designs  in  which  measurement  is  in  the  strength,  of  either  a  nominal 
or  an  ordinal  scale  to  assess  the  "before  to  after"  change,  in  direction 
only,  due  to  a  single  treatment. 

Assumptions/limitations : 

1.  The  usual  parametric  technique  for  analyzing  data  from  two  related 
samples  is  the  application  of  the  T-test  to  the  difference  scores.  There 
are,  however,  circumstances  under  which  the  paired-samples  t-test  is 
inapplicable : 

a.  The  assumptions/requirements  of  the  paired-samples  t-test  are 
considered  unrealistic  or  are  found  not  to  hold  for  the  data. 


22-4 


b.  It  is  preferable  to  avoid  making  the  assumptions  or  testing 
the  requirements  of  the  paired-samples  t-test  and,  thus ,  give  greater 
generality  to  the  conclusions  drawn  (at  the  expense  of  employing  a  less 
powerful  test) . 

c.  Hie  differences  between  matched  pairs  sure  not  represented  as 
scores,  but  rather  as  "signs",  i.e.,  it  is  indicated  which  member  of  any 
pair  is  "greater  than"  the  other,  but  the  magnitude  of  the  difference  is 
not  known. 

d.  Hie  score  data  are  simply  classificatory,  i.e.,  the  two  members 
of  any  matched  pair  can  either  respond  in  the  same  way  or  in  entirely 
different  ways  which  do  not  stand  in  any  order  or  quantitative  relation. 

In  these  instances,  the  McNemar  test  may  be  applied 

2.  If  the  expected  frequency  of  changes  pj(A  +  D)  ]  is  <  5,  the  binomial 
test  (see  reference  under  "Other  test(s)")  should  be  used  rather  than  the 
McNemar  test. 

Discussion/methodology : 

Hie  procedure  for  application  of  the  McNemar  test  involves  these  steps: 
1.  Cast  the  observed  frequencies  into  a  fourfold  table  of  the  form: 


After 


Before 


where  A  is  the  number  of  subjects  who  changed  their  responses  from  -  to  + 
as  a  result  of  the  application  of  the  treatment,  etc. 

2.  Determine  the  expected  frequency  of  change:  E  *  *»(A  +  D) .  If 
E  <  5  use  the  binomial  test  rather  than  the  McNemar  test. 

2  2 

3.  Compute  the  value  of  the  test  statistic  X  observed  or  Xq: 

2  _  (1a-d1-1)2 

*o  A  +  D 


4.  Determine  the  critical  value  for  the  test  statistic,  X  critical 
2 

or  X0‘ 
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or 


if  a  two-tailed  test  at  the 
a-level  of  significance 


t 


if  a  one-tailed  test  at  the 
a-level  of  significance 


2  2 

5.  Resolve  the  hypothesis  test;  if  X  >  X  ,  reject  HQ. 

O  C  ” 

EXAMPLE  22-2 

Currently,  single  enlisted  personnel  in  overseas  areas  residing  in 
dormitories  and  subsisting  in  government  dining  facilities  are  not  authorized 
a  cost  of  living  allowance  (COLA) ,  which  is  paid  to  other  co-assigned  mili¬ 
tary  personnel.  A  proposal  to  fund  singles'  COLA  was  first  submitted  to 
the  Congress  in  DOD's  FY79  supplemental  budget  request.  To  explain  the 
rationale  for  requesting  singles-'  COLA,  HQ  USAF/DCS  Manpower  and  Personnel 
produced  a  Palace  Flick  on  the  subject.  As  part  of  a  test  of  the  potential 
of  this  Palace  Flick  to  influence  the  attitudes  of  targeted  groups  of  key 
military  personnel  concerning  singles'  COLA,  a  sample  of  25  junior  majors 
without  overseas  unit  command  experience  and  assigned  to  the  Pentagon  was 
selected.  Each  subject  was  questioned  about  his  attitude  concerning 
singles'  COLA,  shown  the  Palace  Flick,  and  then  questioned  again  about  his 
attitude.  The  attitude  data  from  the  sample  were  as  follows: 


Attitude 
concerning 
singles '  COLA 
before  being 
shown  the 
Palace  Flick 


Mo  opinion  or 
did  not  favor 

Favored 


Attitude  concerning  singles' 
COLA  after  being  shown 
the  Palace  Flick 


Favored 


MO  opinion  or 
did  not  favor 


14 

3 

6 

2 

Is  the  Palace  Flick  effective  in  favorably  (positively)  influencing 
(changing)  the  attitudes  of  junior  majors  without  overseas  unit  command 
experience  concerning  singles'  COLA?  Test  at  a  -  0.01. 

Hq:  For  those  junior  majors  without  overseas  unit  command  experi¬ 
ence  whose  attitudes  concerning  singles'  COLA  are  influenced 
by  being  shown  the  Palace  Flick,  the  probability  that  any 
major  will  change  his  attitude  from  "Mo  opinion  or  does  not 
favor"  to  "Favor"  (PA)  is  equal  to  the  probability  that  he 
will  change  his  attitude  from  "Favors"  to  "MO  opinion  or  does 
not  favor"  (PD)  is  equal  to  >j.  i.e.  P.  ■  PD  ■  h, 


E  -  *j (A  +  D)  -  *s(14  +  2)  ■  8  E  >_  5?  Yes 

2  „  (1a-dL-1)  2 

*0  A  +  D 

.  ( 114-2 l-l)2 

14  +  2 

*  7.56 


Since  this  is  a  one-tailed  test  at  a  ■  0.01  with  rejection  region  determined 
by  :  pa  >  PD,  it  corresponds  to  a  two-tailed  test  at  2a  *  0.02  with 
rejection  region  determined  by  H^:  Pft  /  PD. 


lo,02,  1 


5.41 


X?  *  7.56  >  5.41  -  x2 

o  c 

Conclusion:  Reject  H0.  Conclude  that  the  Palace  Flick  is  effective  in 
favorably  (positively)  influencing  (changing)  the  attitudes  of  junior 
majors  without  overseas  unit  command  experience  concerning  singles'  COLA 
at  the  a  -  0.01  level  of  significance. 


Reference(s) .  Siegel,  pp.  63-67. 

Other  test(s).  Binomial  test;  see  Siegel,  pp.  36-42. 


C.  The  Two  Independent  Samples  Case.  A  test  of  Occurence  Frequencies/ 

Probabilities  or  for  Independence:  Chi  Square  Contingency  Table. 

General  description/purpose:  A  chi  square  contingency  table  may  be  used 
to  test  whether  a  statistically  significant  difference  exists  between  two 
independent  groups  with  respect  to  some  characteristic;  specifically,  be¬ 
tween  observed  numbers  of  subjects,  objects,  or  responses  falling  into  each 
of  a  set  of  categories  (cells)  from  each  of  two  groups  and  the  expected 
frequencies  of  occurrence  within  the  categories  based  on  the  hypothesis  of 
independence  between  the  groups  with  respect  to  the  characteristic. 

Assumptions/limitations : 

1.  If  the  observed  frequencies  of  occurrence  consist! tute  a  2  x  2  con¬ 
tingency  table  (two  categories  (cells)  for  each  of  the  two  groups  being 
contrasted) ,  then  the  possible  application  of  the  chi  square  contingency 
table  test  should  be  determined  on  the  basis  of  the  following  criteria: 
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a.  When  the  number  of  observations  N  >  40,  use  this  test  with 
the  test  statistic  corrected  for  continuity  (see  step  4  in  "Discussion/ 
methodology"  below). 

b.  When  20  <_  N  <_  40,  use  this  test  with  continuity  correction 
only  if  the  expected  frequency  of  each  category  (cell)  (Eij;  i  ■  1,  2,  . .., 
j  -  1,  2,  ...»  e)  is  >_  5.  If  the  smallest  expected  frequency  is  <  5,  use 
the  Fisher  exact  probability  test  (see  reference  under  "Other  test(s)”). 

c.  When  N  <  20 ,  always  use  the  Fisher  test. 

2.  If  the  degrees  of  freedom  (df)  is  >  2,  use  this  test  only  if  each 
Eij  1  and  at  least  80%  of  the  Eij's  are  >.  5  (category  (cell)  size 
limitations  (criteria) . 

Expected  frequencies  may  be  increased  by  combining  adjacent  cate¬ 
gories  (cells) ,  but  this  is  desirable  only  if  such  combinations  can  be 
meaningfully  made. 

3.  When  df  >  2,  this  test  is  insensitive  to  the  effects  of  order.  If, 
then,  hypotheses  take  order  into  account,  this  test  may  not  be  the  best. 

For  methods  that  strengthen  the  chi  square  tests  when  Ho  is  tested  against 
specific  (including  ordered)  alternatives,  consult  Cochran  (1954)  (see 
"References") . 

Discussion/methodology i 

The  procedure  for  application  of  the  chi  square  contingency  table  test 
involves  these  steps: 

1.  Cast  the  observed  frequency  of  each  category  (cell)  (O^j ,  i  ■  1,  2, 

. ..,  r;  j  -  1,  2,  ...,  c)  into  its  corresponding  position  in  an  (r  x  c)  -  di 
mensional  array.  The  assignments  of  the  groups  and  the  values  of  the 
characteristics  to  either  the  rows  or  the  columns  of  the  array  is  immaterial 
The  sum  of  the  frequencies  should  be  N,  the  number  of  independent  observa¬ 
tions. 

2.  Determine  the  degrees  of  freedom;  df  *  (r-1) (c-1) . 

3.  Determine  the  expected  frequency  of  each  category  (cell)  as  the 
product  of  its  marginal  totals  divided  by  N: 


marginal  total  of  row  i)* (marginal  total  of  column 


N 

j  "1,  2,  ••«,  c 


When  df  ^  2,  if  more  than  20%  of  the  Eij’s  are  <5,  combine  adjacent  cate¬ 
gories  (cells),  where  this  is  reasonable,  until  the  category  (cell)  size 
limitations  (criteria)  are  met.  (Note:  The  values  of  r  and/or  c  will  be 
reduced,  while  the  values  of  some  of  the  En'e  will  be  increased.) 


The  data  may  be  conveniently  displayed  as  follows: 
Groups  or  Values  of  the  Characteristic 


0,_ 


Marginal 

Totals 


°11 

E11 

°12 

E 

12 

°rl  °r2 


0 

rc 

E 

rc 

c 

£  0  . 
i-1  r3 

S  °ic 

i=l  1C 

N 

Where  E. 


and  N  »  £  £  oH 

i=l  j=l 


2  2 

4.  Compute  the  value  of  the  test  statistics,  X  observed  or  X^: 


l  t  (IOjj-Eijl-%)2 

i-1  j-1  Ei  j 

or 


if  df  «  1  (r  »  2  and  c 


r  c  r  c  to 

i-i  j-i  Eij  |^i-i  j-i  l  Ei 


-  H  if  df  >  2 


5.  Determine  the  critical  value  for  the  test  statistic,  X  critical 


or  X?: 


Xa,df  "  Xa,  <r-l)  tc-U 


if  a  two-tailed  test  at 
the  a-level  of  significance 


x2 

l  *2a. 


df  *2a,  (r-lH.c-1) 


if  a  one-tailed  test  at 
the  a-level  of  significance 


2  2 

Resolve  the  hypothesis  test:  if  X  >  X  ,  reject  HQ. 

o  c 


EXAMPLE  22-3 

To  increase  safety-consciousness  and  reduce  the  incidence  of  work- 
related  accidents  throughout  the  command,  AFLC  is  considering  whether  all 
personnel  should  be  tested  annually  on  safety  procedures  and  regulations . 
In  preparation  for  this  testing,  it  has  been  proposed  that  all  personnel 
annually  attend  a  safety  refresher  mini-course.  To  investigate  the  effect 
of  attendance  at  such  a  mini-course  on  personnel  performance  on  a  safety 
test,  the  headquarters  staff  developed  three  (3)  safety  refresher  mini¬ 
courses  with  two  (2) ,  four  (4)  ,  and  eight  (.8)  hours  of  instruction  and 
drafted  a  test  on  safety  procedures  and  regulations.  At  an  unidentified 
ALC,  200  randomly  selected  persons  attended  either  one  or  none  of  the  mini 
courses  and  then  were  administered  the  safety  test.  The  test  results  were 
as  follows: 

Number  of  hours  of  instruction  attended 
in  safety  refresher  mini-course 


No  instruction  2  hours  4  hours 


8  hours 


Performance 
on  safety 


9 

50 

38 

18 

21 

50 

12 

2 

On  the  basis  of  the  outcomes  of  this  experiment,  should  AFLC  personnel's 
performance  on  a  test  on  safety  procedures  and  regulations  be  expected  to 
be  affected  by  the  number  of  hours  of  instruction  attended  in  a  safety 
refresher  mini-course?  Test  at  a  ■  0.05. 

Hq;  AFLC  personnel's  performance  on  a  test  on  safety  procedures 

and  regulations  is  independent  of  the  number  of  hours  of  instruc 
tion  attended  in  a  safety  refresher  mini -course. 

Hi:  It  is  dependent  thereon. 


Number  of  hours  of  instruction  attended  in  safety 
refresher  mini-course 


Performance 
on  Safety 


I  Marginal 
Total* 


17.25 


12.75 


57.50  28.75  11.50 


42.50  21.25 


MARGINAL 

TOTALS 


\wB 


r  *  2,  c  *  4  df»  (r-l)Cc-l)  ■  (2-1) (4-1)  -  3 


(Marginal  total  of  row  1) * (Marginal  total  of  column  1) 

N 


(11S)  (30) 


■  17.25,  etc. 


Since  df  >_  2,  is  each  >_  1  and  are  at  least  80%  of  the  Ej^s  >_  5?  Yes 
°  i-1  j-1  Eil 


(9-17.5) 


(1,1)  cell 


(50-57.5)' 


(1,2)  cell 


'  _  T  - 

r»2  c=»4  o  •  a  2 

z  z  zr^~  -  n 

i-i  j-i  Eij 

J 


(2-8.5) 

8.5 

(2,4)  cell 


(1,1)  cell  (1,2)  cell 


*  27.23 


+  ....  + 


(2,4)  cell 


2  2  2 

X  -  X  «  X*  _  -  7.81 

c  a,df  0.05,3 


X?  *  27.23  >  7.81  -  x2 


Conclusion:  Reject  H0.  Strongly  conclude  that  AFLC  personnel's  performance 
on  a  test  on  safety  procedures  and  regulations  should  be  expected  to  be 
affected  by  (dependent  on)  the  number  of  hours  of  instruction  attended  in 
a  safety  refresher  mini-course  at  the  a  •  0.05  level  of  significance. 

Reference (s) .  Siegel,  pp.  104-111 

Other  testCs).  Fisher  exact  probability  test;  see  Siegel,  pp.  96-104 


Measures  of  Correlation:  Pearson's  Contingency  Coefficient 

General  description/purpose:  Pearson's  contingency  coefficient  C  is  a 
measure  of  the  extent  of  association  or  relation  between  two  independent 
groups  with  respect  to  some  characteristic,  i.e.,  between  two  sets  of 
attributes.  It  is  uniquely  useful  when  there  is  only  nominal  (categorical) 
level  information  available  concerning  one  or  both  groups,  such  as  when 
that  information  consists  of  unordered  series  of  occurrence  frequencies, 
the  use  of  this  contingency  coefficient,  in  contrast  to  Pearson's  para¬ 
metric  product  moment  (sample  correlation  coefficient  r) ,  does  not  require 
either  Cl)  the  assumptions  of  underlying  continuity  or  a  specific  popula¬ 
tion  distribution  for  the  categories  which  comprise  either  or  both  sets  of 
attributes,  or  (2)  that  the  categories  be  o referable  in  any  particular  way. 
The  latter  insures  that  this  contingency  coefficient,  as  computed  from  a 
contingency  table,  will  have  the  same  value,  regardless  of  the  arrangements 
of  the  categories  (cells)  in  the  rows  and  columns  of  the  table.  The 
coefficient's  value  always  lies  between  0  and  1,  and  equals  0  in  the  case 
of  complete  independence  between  the  groups  with  respect  to  the  charac¬ 
teristic,  i.e.,  when  ■  0. 

Assumptions/limitations : 

Pearson's  C  suffers  from  four  limitations: 

1.  The  data  must  be  amenable  to  computation  of  the  chi  square  test 

statistic  (X2  observed  or  X2)  before  C  may  appropriately  be  used. 

o 

2.  Even  though  C  ■  0  in  the  case  of  complete  independence,  i.e.,  when 
X2  ■  0,  C  cannot  attain  its  upper  limit  of  1;  in  the  case  of  complete  asso¬ 
ciation,  the  value  of  C  depends  on  the  numbers  of  rows  and  columns  in  the 
contingency  table. 

3.  Two  nonzero  Pearson's  contingency  coefficients  are  not  comparable 
unless  they  are  yielded  by  contingency  tables  of  the  same  size. 

4.  C  is  not  directly  comparable  to  any  other  measure  of  correlation, 
such  as  Pearson's  r,  Spearman's  r_,  or  Kendall's  t. 


Discussion/methodology : 


1.  Despite  the  above  limitations ,  C  is  an  extremely  useful  measure  of 
association  because  of  wide  applicability  resulting  from  its  freedom  from 
assumptions  and  other  restrictive  requirements.  C  may  be  used  to  indicate 
the  degree  of  relation  between  two  sets  of  data  when  other,  especially 
parametric,  measures  of  association  are  inapplicable . 


2.  Pearson's  contingency  coefficient  C  is  defined  as  follows: 


C  - 


y  observed 
N  +  X2  observed 


EXAMPLE  22-4 


The  previous  example  of  a  chi  square  contingency  table  is  continued. 


C  « 


y2  observed 
N  +  X2  observed 


V 


27.23 

200  +  27.23 


«  0.3462 


Hie  correlation  expressed  by  Pearson 1 s  contingency  coefficient,  between  AFLC 
personnel's  performance  on  a  test  on  s'afety  procedures  and  regulations  and 
the  number  of  hours  of  instruction  attended  in  a  safety  refresher  mini¬ 
course  is  C  a  0.35. 

Reference Cs) . 

1.  Everitt,  B.  S.  Hie  Analysis  of  Contingency  Tables.  (Halstead  Press 
New  York,  1977}  pp.  56-57. 

2.  Siegel,  pp.  196-202. 


Cramer's  Contingency  Coefficient 

General  description/purpose :  Like  Pearson' •  contingency  coefficient  C, 
Cramer's  C  is  also  a  measure  of  the  extent  of  association  or  relation 
between  two  independent  groups  with  respect  to  some  characteristic,  i.e., 
between  two  sets  of  attributes.  Generally,  this  contingency  coefficient 
may  be  similarly  described  as  Pearson's  C.  In  particular,  its  value  also 
lies  between  0  and  1,  and  equals  0  in  the  case  of  complete  independence 
between  the  groups  with  respect  to  the  characteristic,  i.e.,  when  X^  ■  0. 


As  sump tions/1 imitations : 


Cramer's  C  exhibits  three  properties  similar  to  the  limitations  on 
Pearson's  C: 

1.  The  data  must  be  amenable  to  computation  of  the  chi  square  test 
statistic  (X2  observed  or  X2)  before  Cramer's  C  may  appropriately  be  used. 

2.  Cramer's  C,  however,  in  contrast  to  Pearson's  C,  may  attain  its 
upper  limit  of  1  in  the  cue  of  complete  association  for  all  numbers  of 
rows  and  columns  in  the  contingency  table. 

3.  Cramer's  C  is  also  not  directly  comparable  to  any  other  measure  of 
correlation,  such  as  Pearson's  r,  Spearman's  r8,  Kendalls  t,  or  even 
Pearson's  C. 


Discussion/methodology : 

Cramer ' s  contingency  coefficient  is  defined  as  follows: 


Cramer's  C 


2 

X  observed/M 
min  (r-1,  c-1) 


EXAMPLE  22-5 


The  previous  example 

Cramer's  C  * 


of  a  chi  square  contingency  table  is  continued. 

X2  observed/N 
min  (r-1,  c-1) 


m  27.23/200 
min  (2-1,  4-1) 

*  0.1362 

The  correlation,  expressed  by  Cramer's  contingency  coefficient,  between 
AFLC  personnel's  performance  on  a  test  on  safety  procedures  and  regulations 
and  the  number  of  hours  of  instruction  attended  in  a  safety  refresher  mini- 
course  is  a  0.14.  (Mote  that  this  correlation,  expressed  by  Pearson's  con¬ 
tingency  coefficient,  is  C  a  o.35j  but,  these  values  are  not  directly 
comparable.) 

Reference (s) :  Everitt,  pp.  56-57. 


MEASURES  AND  TESTS  INVOLVING  ORDINAL  LEVEL  DATA 


A.  Related  Samples  Tests  For  Differences  or  Changes : 

A  two-sample  Test:  Wllcoxon  Matched-Pairs  Signed-Ranks  Test 

General  description/purpose:  The  Wllcoxon  matched  pairs  signed-ranks 
test  may  be  used  to  test  whether  a  statistically  significant  mean  difference 
exists  between  the  response  levels  of  two  treatments ,  or  whether  one  treat¬ 
ment  is  "better"  than  another,  when  each  subject  "serves  as  his  own  con¬ 
trol”  by  being  exposed  to  both  treatments  at  different  times.  It  is  particu¬ 
larly  applicable  to  "before  and  after"  treatment  designs  in  which  measure¬ 
ment  is  in  the  strength  of  either  an  interval  or  a  ratio  scale  to  assess 
the  "before  to  after”  change,  in  both  (relative)  magnitude  and  direction, 
due  to  a  single  treatment. 

Assumptions/limitations 

1.  The  usual  parametric  technique  for  analyzing  data  from  two  related 
samples  for  differences  or  changes  is  the  application  of  the  t-test  to  the 
difference  scores.  There  are,  however,  circumstances  under  which  the 
paired-samples  t-test  is  inapplicable: 

a.  The  assumptions/requirements  of  the  paired-samples  t-test  are 
considered  unrealistic  or  jure  found  not  to  hold  for  the  data. 

b.  It  is  preferable  to  avoid  making  the  assumptions  or  testing  the 
requirements  of  the  paired-samples  t-test  and,  thus,  give  greater  generality 
to  the  conclusions  drawn  (at  the  expense  of  employing  a  less  powerful  test) . 

2.  For  n  matched  pairs  of  observations,  (Xi,  Yj_) : 

a.  The  measurement  scale  for  the  Xs  and  the  Ym  is  either  inter¬ 
val  or  ratio. 

b.  The  pairs,  (Xi,  Yj_) ,  i  ■  1,  ...,  n  constitute  a  random  sample 
from  a  bivariate  population  distribution. 

3.  For  the  difference,  D^,  between  Xi  and  Yi  for  the  ith  pair  of 
observations : 

a.  Each  Di  is  a  continuous  random  variable. 

b.  The  distribution  of  each  Di  is  symmetric. 

Discussion/methodology : 

The  procedure  for  application  of  the  Wilcoxon  matched-pairs  signed- 
ranks  test  involves  these  steps: 

1.  Determine  the  differences,  Di  ■  Xj.  -  Yi,  i  m  1,  ...,  n. 
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2.  If  any  D^*  0,  drop  it  from  the  set  and  decrease  n  by  one  (1) . 

3.  Rank  order  the  absolute  values  of  the  differences,  |DjJ .  if  ties 
occur,  average  the  ranks  of  the  absolute  differences  involved  in  the  tie 
and  assign  the  average  as  the  rank  of  each  tied  value. 

4.  Suffix  to  each  rank  the  algebraic  sign  of  the  difference,  D^, 
corresponding  to  it. 

5.  Determine  the  total  of  the  positive  signed-ranks,  R+,  and  the  total 
of  the  negative  signed-ranks,  R-. 

i 

6.  Determine  the  value  of  the  test  statistic,  R:  R  ■  smaller  of 
R+,  R-. 


7.  Determine  from  a  table  the  critical  value  for  the  test  statistic. 


or  ffc,  and  the  appropriate  algebraic  sign  of  the  smaller  sum  of 


W  (Lower  tail  critical  value) 

**  if  a  two-tailed  test  at  the 
a-level  of  significance 
(AND  R  -  either  R-  or  R+) 

w  (Lower  tail  critical  value) 

if  a  one-tailed  test  at  the 
a-level  of  significance 
AND  R  -  Rf  if  a  one-tailed  test 
to  the  left,  R  ■  R-  if  a  one- 
tailed  test  to  the  right 


8.  Resolve  the  hypothesis  test:  if  R  <  Wc,  and  the  smaller  sum  of  ranks 
is  of  the  appropriate  algebraic  sign  for  a  one-tailed  test,  reject  H0. 


EXAMPLE  22-6 


Ten  USAF  enlistees  were  randomly  selected  while  undergoing  Basic 
Military  Training  (BMT)  to  determine  whether  a  recently-devised  speed¬ 
reading  program  can  improve  their  reading  rates.  The  enlistees  were  given 
standardized  reading  exams  before  and  after  the  program.  The  exam  scores 
in  words  per  minute  were  as  follows: 


Enlistee 


Before 


After 


Is  the  recently-devised  speed-reading  program  effective  in  improving 
(increasing)  the  reading  rates  of  enlistees  during  BMT?  Test  at  a  -  0.05. 

Hq  «  y  before  y  after 


Hj_  —  y  before  <  y  after 


■Hie  differences,  Dj.,  the  ranks  of  |DjJ ,  and  the  algebraic  signs  of  these 
ranks  sure: 


Enlistee  D-i 


Sign 


1 

2 

3 

4 
5» 
6 

7 

8 
9 

10 


-25 

-225 

-350 

+20 

+20 

0 

-90 

-5 

-30 

-100 


tie 


4 
8 
9 

2.5 

(eliminate) 

6 

1 

5 
7 


+ 

+ 


Since  D6  -  0,  eliminate  it.  Now  n  -  10  -  1  *  9 
R+  -  2.5  +  2.5  -  5 
R-  -  4  +  8  +  9  +  6  +  1  +  5  +  7-40 


Conclusion j  Reject  Hq.  Conclude  that  the  recently-devised  speed-reading 
program  is  effective  in  improving  (increasing)  the  reading  rates  of  enlistees 
during  BMT  at  the  a  -  0.05  level  of  significance. 

Reference (s) .  Siegel,  pp.  75-83. 

Other  test(s).  For  large  samples,  especially  for  n  beyond  the  range  of 
tabulated  critical  values,  there  is  a  normal  approximation;  see  Siegel, 
pp.  79-83. 
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Table  22-1 


Critical  Values  of  the  Wilcoxon  Matched-Pairs  Signed-Ranks  Test  Statistic 


Sample  size 


n 

w 

.005 

W 

.01 

U 

.025 

W 

.05 

w.io 

W 

.20 

n(n+l)/2 

4 

0 

0 

0 

0 

1 

3 

10 

S 

0 

0 

0 

1 

3 

4 

15 

6 

0 

0 

1 

3 

4 

6 

21 

7 

0 

1 

3 

4 

6 

9 

28 

8 

1 

2 

4 

6 

9 

12 

36 

9 

2 

4 

6 

9 

11 

15 

45 

10 

4 

6 

9 

11 

15 

19 

55 

11 

6 

8 

11 

14 

18 

23 

66 

12 

8 

10 

14 

18 

22 

28 

78 

13 

10 

13 

18 

22 

27 

33 

91 

14 

13 

16 

22 

26 

32 

39 

105 

15 

16 

20 

26 

31 

37 

45 

120 

16 

20 

24 

30 

36 

43 

51 

136 

17 

24 

28 

35 

42 

49 

58 

153 

18 

28 

33 

41 

48 

56 

66 

171 

19 

33 

38 

47 

54 

63 

74 

190 

20 

38 

44 

53 

61 

70 

82 

210 

Table 

entries  are 

lower  tail  critical 

values 

Source 

:  Adapted  from  R.  L. 

.  McCormack, 

"Extended  Tables 

of  the  Wilcoxon 

Hatched  Pairs  Signed  Rank  Statistics,” 

Journal 

of  the 

American 

Statistical 

Association,  60  (1965Jj,  pp. 

864-71. 

A  k-Sample  Test: 

Friedman 

2-way  ANOVA  by  Ranks 

Test 

General  Description/purpose:  The  Friedman  2-way  analysis  of  variance 
(ANOVA)  by  ranks  test  may  be  used  to  test  whether  statistically  significant 
differences  exist  among  k  populations  from  which  matched  samples,  each 
having  the  same  number  of  cases,  have  been  drawn.  The  matching  may  be 
achieved  by  studying  the  same  group  of  n  subjects  under  each  of  k  conditions, 
or  n  matched  groups  of  k  subjects  may  be  obtained,  and  then  one  subject  in 
each  group  randomly  assigned  to  the  first  condition,  one  subject  in  each 
group  to  the  first  condition,  one  subject  in  each  group  to  the  second  con¬ 
dition,  etc.,  through  the  kth  condition.  Observations  are  cast  into  a 
2-way  table  having  n  rows,  representing  the  various  individual  subjects 
or  matched  groups  of  subjects,  and  k  columns,  representing  the  various 
conditions.  If  the  scores  of  subjects  serving  under  all  conditions  are  under 
study,  then  each  row  of  the  table  presents  the  scores  of  one  individual 
subject  or  one  matched  group  of  subjects  under  each  of  the  k  conditions. 

The  data  for  this  test  are  ranks.  The  scores  for  each  subject,  i.e.,  in 
each  row,  are  ranked  separately.  So,  when  k  conditions  are  being  studied, 
the  ranks  for  each  subject  range  from  1  to  k.  The  Friedman  test  determines 


whether  it  is  likely  that  the  k  columns  of  ranks  (k  matched  samples)  were 
drawn  from  the  same  population. 

Assumptions/limitations : 

1.  The  usual  parametric  technique  for  analyzing  data  from  k  related 
samples  is  the  application  of  the  2 -way  ANOVA  with  k  treatments  (conditions) 
and  n  cases  (subjects).  There  are ,  however,  circumstances  under  which  the 
2-way  ANOVA  is  inapplicable: 

a.  The  assumptions/requirements  of  the  2-way  ANOVA  are  considered 
unrealistic  or  sure  found  not  to  hold  for  the  data. 

b.  It  is  preferable  to  avoid  making  the  assumptions  or  testing  the 
requirements  of  the  2-way  ANOVA  and,  thus,  give  greater  generality  to  the 
conclusions  drawn  (at  the  expense  of  employing  a  less  powerful  test) . 

2.  The  observations  from  the  k  matched  samples  sure  in  at  least  an 
ordinal  scale. 

Discussion/methodology : 

The  procedures  for  application  of  the  Friedman  2-way  ANOVA  by  ranks 
test  involves  these  steps: 

1.  Cast  the  observations  (scores)  into  a  2-way  table  having  k  columns 
(conditions),  and  n  rows  (individual  subjects  or  matched  groups  of  subjects) . 

2.  Rank  the  observations  (scores)  in  each  row  from  1  to  k.  It  is 
immaterial  whether  the  ranking  is  from  lowest  to  highest  or  from  highest  to 
lowest;  however,  the  ranking  must  be  consistently  applied  across  all  n  rows. 

3.  Determine  the  sum  of  the  ranks  in  each  column,  1 ^ . 

2 

4.  Determine  the  value  of  the  test  statistic,  X  ranks  or  X|,  when  no 
tied  ranks  are  permitted: 


where  n  ■  number  of  rows  (subjects) 

k  -  number  of  columns  (conditions) 

Rj  ■  sum  of  ranks  in  jth  column 
k 

Z  directs  the  summation  of  the  squares  of  the  sums 

j«l  of  the  ranks  over  all  k  conditions 


2  2 

S.  Determine  the  critical  value  for  the  test  statistic,  X  ,  or  X 

critical  c 


A  —  A  .  . 

c  a,  k-1 

2  2 

6.  Resolve  the  hypothesis  test:  if  X  >  X  ,  reject  H  . 

r  c  o 


EXAMPLE  22-7 


Four  chief  master  sergeants  are  finalists  in  the  competition  for  AFLC 
Senior  Superintendent  of  the  Year.  After  reviewing  their  military  service 
records  and  personally  interviewing  them,  a  panel  of  five  officers,  serving 
as  judges  for  the  competition,  ranked  the  finalists.  The  ranks  assigned 
to  the  finalists  by  the  officer- judges  were  as  follows: 


1 
2 

Officer-Judges  3 

4 

5 

Do  the  five  officer-judges  agree  in  their  rankings  of  the  four  finalists? 
Test  at  o  ■  0.10.  If  the  officer- judges  agree,  what  is  their  overall 
(conqposite)  rank  order  of  the  four  finalists? 

Hq  -  There  is  no  agreement  among  the  five  officer-judges  in  their 
rankings  of  the  four  finalists . 

H]_  -  There  is  agreement. 

The  sums  of  the  ranks  in  the  columns,  R j ,  are: 


n 

Rj  «  Z  R^j  n  ■  5;  j  -  1,  ...»  k>  k  «  4 

J  i-1  13 

Rx  -  3  +  1  +  1  +  3  +  3  +  4-14 

CCMSgt  A) 

R2  -  7 

(CMSgt  8) 

R3  -  11 

(CMSgt  C) 

R4  -  18 

(CMSgt  D) 

Finalists 

CMS  crt  A  CMSqt  B  CMSqt  C  CMS  at  D 


3 

1 

2 

4 

1 

2 

4 

3 

3 

1 

2 

4 

3 

2 

1 

4 

4 

1 

2 

3 
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-  3n(k+l) 


C  "  1 — “ -  2  CRJ 

r  r  k.  OtflL  j«i  J 


12 


(5)  C4)  (5) 


< 


<14)2C7)2+  (ll)2  +  (18) 2 


f 


(3)  (5)  (5) 


(0.12) (690)  -  75  -  82.8  -  75 
7.8 


Xc  "  Xa,k-1  "  X0 .10,3  "  6,25 


x2  -  7.8  >  6.25  -xx2 
r  c 


Conclusion:  Reject  Hq.  Conclude  that  there  is  agreement  among  the  five 
of fleer- judges  In  their  rankings  of  the  four  finalists,  i.e. ,  the  five 
ranks  assigned  by  the  officer-judges  to  each  of  the  four  finalists  were 
drawn  from  different  populations,  at  the  a  -  0.10  level  of  significance. 


The  average  ranks,  R ,  where 


j' 


R  - 

j  n 


“  n  E  RH 

n  i-1 


or,  equivalently,  the  sums  of  the  ranks,  Rj,  may  now  themselves  be  arranged 
in  ascending  order  to  evidence  the  panel's  overall  (composite)  rank  order. 


Overall  (composite) 
Rank 

Finalist 

h 

1st 

CMSgt  B 

1.4 

7 

Lowest 

2nd 

CMSgt  C 

2.2 

11 

3rd 

CMSgt  A 

2.8 

14 

4th 

CMSgt  D 

3.6 

18 

Highest 

Reference (s) 

1.  Siegel,  pp.  166-172. 

2.  Winer,  B.  J.,  Statistical  Principles  in  Experimental  Design,  McGraw- 
Hill,  New  York,  1962,  pp.  136-137. 

Other  Test(s) 

1.  For  certain  cases  in  which  n  and  k  are  not  large,  the  exact  proba¬ 
bilities  (  prob- values)  of  occurrence  are  tabulated  for  values  of  the  test 
statistic  X2  ranks  as  large  as  an  observed  (computed)  X2,  see  Siegel. 


2.  For  the  determination  of  an  appropriate  test  statistic  when  tied 
ranks  are  permitted,  see  Winer,  pp.  136-137. 

3.  For  Cochran  Q  test  for  k  related  samples,  an  extension  of  the 
McNenar  test  for  two  related  samples,  provides  a  method  to  test  whether 
three  or  more  matched  sets  of  frequencies  or  proportions  (samples)  are  sta¬ 
tistically  significantly  different  among  themselves.  The  matching  may 
again  be  based  on  either  the  fact  that  the  same  individual  subjects  are 
employed  under  different  conditions  or  on  relevant  characteristics  of  dif¬ 
ferent  subjects  by  which  matched  groups  may  be  defined.  The  Cochran  test 
is  particularly  suitable  when  the  observations  are  in  a  nominal 
(categorical)  scale  or  constitute  dichotomized  (two- valued)  ordinal  infor¬ 
mation,  e.g.,  a  manager's  preference  between  two  alternative  decisions 
and/or  courses  of  action.  See  Siegel,  pp.  161-166. 

B.  Independent  Samples  Tests  of  Central  Tendency 

A  Two- Sample  Test:  (Wilcoxon) -Mann-Whitney  Test. 

General  description/purpose :  The  (Wilcoxon) -Mann-Whitney  test  is 
designed  to  assist  in  determining  whether  the  distributions  of  two  indepen¬ 
dent  populations  are  the  same  or  different.  If  it  is  assumed  that  any  dif¬ 
ference  between  the  two  population  distributions  is  due  only  to  a  dif¬ 
ference  in  location  of  the  two  distributions,  then  the  (Wilcoxon) -Mann- 
Whitney  test  may  be  used  to  test  whether  a  statistically  significant  dif¬ 
ference  exists  between  the  centred,  tendencies  of  the  two  distributions,  as 
reflected  by  their  medians. 

Assumptions/limitations : 

1.  The  usual  parametric  technique  for  analyzing  data  from  two  indepen¬ 
dent  samples,  from  populations  whose  variances  are  assumed  equal,  for  a 
difference  in  central  tendencies  is  the  application  of  the  t-test  to  the 
difference  of  their  means.  There  are,  however,  circumstances  under  which 
the  two  independent  samples  t-test  is  inapplicable: 

a.  The  assumpticns/requirements  of  the  two  independent  samples  t- 
test  are  considered  unrealistic  or  are  found  not  to  hold  for  the  data. 

b.  It  is  preferable  to  avoid  making  the  assumptions  or  testing  the 
requirements  of  the  two  independent  samples  t-test  and,  thus,  give  greater 
generality  to  the  conclusions  drawn  (at  the  expense  of  employing  a  less 
powerful  test) . 

2 .  For  nA  observations ,  X.  ,  from  population  A  and  nB  observations , 

XBl,  from  population  B:  1 

a.  The  measurement  scale  for  the  XA*  and  the  XB s  is  at  least 


ordinal 


b.  XA^,  i  *  1,  . . . ,  nA  and  Xg^ ,  i  ■  1,  . . . ,  nB  constitute  independent 

random  samples .  x 

c.  The  sample  randan  variables,  amd  XQ,  are  continuous. 

Discussion/methodology.  The  procedure  for  application  of  the  (Wilcoxan)- 
Mann- Whitney  test  involves  these  steps: 

1.  Arrange  in  ascending  order  the  nA  +  nB  observations  from  the  com¬ 
bined  samples . 

2.  Assign  the  ranks  1,  nA  +  nB  to  the  ordered  observations.  If 

ties  occur,  average  the  ranks  of  the  observations  involved  in  the  tie  amd 
assign  the  average  as  the  rank  of  each  tied  value. 

3.  Determine  the  sum  of  the  rarnks  assigned  to  the  XAi,  SA,  and  the  stun 
of  the  ramks  assigned  to  the  XB^,  S0: 

nA 

S_  -  E  R(XaJ 
A  i-1  Ai 

where  R(XA^)  is  the  rank  assigned  to  the  observation  XA^,  amd 
nB 

S®  ml  R(XBi) 


similarly. 

(Note:  As  a  computational  check: 

4.  Determine  the  value  of  the 
a.  If  a  two-tailed  test  T 


=  (nA  +  nB)  (nA  +  nB  +  1) 
SA  B  =  2 

test  statistic,  T: 

■  either  T,  or  T  ,  where 

A  o 


‘B 


H  '  [^T^\ 

Sa_[ 


b.  If  a  one-tailed  test,  T  -  whichever  of  TA  or  TB  (as  defined 
above)  corresponds  to  the  population  with  the  smaller  mediam  as  hypothe¬ 
sized  under  H^,  i.e.,  (1)  if  Hj_:  median  (XA)  <  median  (XB) ,  T  -  TA. 

(2)  If  H^:  median  (XA)  >  mediam  (XB) ,  T  ■  TQ. 
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5.  Determine  the  critical  value (s)  for  the  test  statistic,  W 


of 


critical 


a.  If  a  two-tailed  test  at  the  a- level  of  significance. 


W, 


W 


a/2 


W 


l-a/2 


(lower  tail  critical  value 
from  a  table,  and 

(upper  tail  critical  value) 


nAnB  a/2 


b.  If  a  one-tailed  test  at  the  a-level  of  significance,  V»c  = 

(lower  tail  critical  value)  from  a  table. 

b.  Resolve  the  hypothesis  test: 

a.  If  a  two-tailed  test,  if  either  T  <  or  T  >  gy  >  reject 

V  ’  "/2 

b.  If  a  one-tailed  test,  if  T  <  Wa,  reject  Hq. 

EXAMPLE  22-8 


A  chief  of  maintenance  desires  to  know  whether  a  difference  exists 
between  the  times  to  failure  of  a  certain  electronics  component  from  two 
competing  manufacturers.  From  field  tests,  times  to  failure,  in  hours, 
of  the  component  from  the  two  manufacturers,  A  and  B,  were  obtained  as 
follows : 

A:  76,  104,  125,  73,  87,  69,  149,  78,  101 

B:  97,  88,  92,  101,  83,  71,  90,  94,  91,  85,  111 

Do  the  average  times  to  failure  of  the  component  differ  between  the  two 
manufacturers?  Test  at  a  ■  0.05. 

X^  *  random  variable,  time  to  failure  in  hours  of  the 
component  from  manufacturer  A. 

XQ  -  similarly  defined 

Hq:  median  (XA)  «  median  (X^) 

:  median  (Xft)  f  median  (Xg) 

The  rank-ordered,  ascending  arrangement  of  the  combined  samples,  with  ties 
denoted  by  an  overbar  (-) ,  is: 
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Rank: 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

Manufacturer : 

A 

B 

A 

A 

A 

B 

B 

A 

B 

B 

Observation: 

69 

71 

73 

76 

78 

83 

85 

87 

88 

90 

Rank: 

11 

12 

13 

14 

15.5 

15.5 

17 

18 

19 

20 

Manufacturer : 

B 

B 

B 

B 

A 

B 

A 

B 

A 

A 

Observation: 

91 

92 

94 

97 

t101 

101 

104 

111 

125 

149 

These  hies  assigned  average  of  tied  ranks. 


Sft  *  E  R(X  ) 
A  i-1  Ai 


nA"9 


1  +  3  +  4  +  ...  +  20  *  92.5 


E  RCXg.)  nB  -  11 

i=l  i 

2+6+7+...+  18  «  117.5 


Checks  S*  +  Sn  -  92.5  +  117.5  -  210  * 
A  B 


{nA  +  V (nA  +  nB  +  1} 
2 


(20)  (21) 
2 


210  yes 


TA  '  SA  - 


-  92.5  - 

-  47.5 


nA(nA  *  1} 
2 

(9) (10) 


92.5  -  45 


Tb  -  SD  - 
B  B 


nB(nB  +  1) 


-  117.5  - 

-  51.5 


(ID  (12) 

2 


-  117.5  -  66 


Since  this  is  a  two-tailed  test,  T  •  either  T  or  T . 

A  B 


22-25 


Wo/2  "  W0.025;  nA  -  9,  -  11  “  24 

from  Table  (K) ,  page  276 
of  Siegel,  and 

Wl-a/2  “  w0.975;  nft  -  9,  nB  -  11 


n  -  W 
A  B  a/2 


-  (9) (11)  -  23  -  99  -  24  -  75 


If  t  ■  either  T  or  T  : 

A  B 


W  »  24  <  47.5  -  T  ,  51.5  -  T  <  76  -  W, 
a/2  A  B  l-a/2 


i.e.,  neither  T  <  W  nor  T  >  W. 

a/2 


l-a/2 


Conclusion:  Fail  to  reject  Hq.  There  is  insufficient  sample  evidence  to 
conclude  that  the  average  times  to  failure  in  hours  of  the  component  differ 
between  the  two  manufacturers  at  the  a  *  0.05  level  of  significance. 

Reference (s) .  Siegel,  pp.  116-127. 

Other  test (s) : 

1.  For  large  samples,  expecially  for  nA»  and  nBs  beyond  the  ranges 
of  tabulated  critical  values,  there  is  a  normal  approximation,  with  a 
correction  for  excessive  numbers  of  ties;  see  Siegel,  pp.  121-126. 

2.  The  median  test  also  provides  a  method  to  test  whether  two  indepen¬ 
dent  populations  differ  in  central  tendencies  and  may  also  be  employed  when¬ 
ever  observations  are  in  at  least  an  ordinal  scale  of  measurement.  However, 
the  median  test  is  almost  always  less  powerful  them  the  (Wilcoxon)  -Mann- 
Whitney  test.  Additionally,  it  possesses  the  unattractive  feature  that 

its  power-efficiency  (the  percent  of  time  the  test  will  concur  in  the  con¬ 
clusion  of  rejection  of  the  null  hypothesis  with  the  parametric  t-test, 
when  both  are  applied  to  data  which  meet  the  requirements  for  analysis  with 
the  parametric  t-test)  decreases  as  the  sample  sizes  increase.  See  Siegel, 

pp.  111-116. 

A  k-Sample  Test:  Kruakal-Wallis  1-Way  ANOVA  by  Ranks  Test 

General  description/purpose:  The  Kruskal-Wallis  1-way  analysis  of 
variance  (ANOVA)  by  ranks  test  is  designed  to  assist  in  determining  whether 
the  distributions  of  k  independent  populations  are  the  same  or  different. 


22-26 


If  it  is  assumed  that  any  differences  among  the  k  population  distributions 
are  due  only  to  differences  in  location  of  the  k  distributions,  then  the 
Kruskal -Wallis  test  may  be  used  to  test  whether  statistically  significant 
differences  exist  among  the  central  tendencies  of  the  k  distributions,  as 
reflected  by  their  medians . 

Assumptions/limitations : 

1.  The  usual  parametric  technique  for  analyzing  data  from  k  indepen¬ 
dent  samples,  from  populations  whose  variances  are  assumed  equal,  for  dif¬ 
ferences  in  central  tendencies  is  the  application  of  the  single  factor, 
completely  randomized  design  analysis  of  variance  (1-way  ANOVA) .  There 
are,  however,  circumstances  under  which  the  1-way  ANOVA  is  inapplicable : 

a.  The  assumptions/requirements  of  the  1-way  ANOVA  are  considered 
unrealistic  or  are  found  not  to  hold  for  the  data. 

b.  It  is  preferable  to  avoid  making  the  assumptions  or  testing  the 
requirements  of  the  1-way  ANOVA  and,  thus,  give  greater  generality  to  the 
conclusions  drawn  (at  the  expense  of  employing  a  less  powerful  test) . 

2.  For  n^  observations,  Xi^,  from  population  1,  n2  observations,  X2 . , 
from  population  2,  ...,  nk,  observations  Xk^,  from  population  k. 

a.  The  measurement  scale  for  the  Xj,  j  »  1,  . . . ,  k  is  at  least 

ordinal. 

b.  X1±,  i  ■  1,  ...»  nj_,  X2,,  i  *  1,  •••»  n2»  ...,  Xk  ,  i  =  1,  ...,  nk 
constitute  mutually  independent  random  samples. 

c .  The  sample  random  variables ,  X j ,  j  **  1 ,  . . . ,  k,  are  continuous . 
Discussion/methodology : 

The  procedure  for  application  of  the  Kruskal -Wallis  test  involves  these 
steps : 


1.  Arrange  in  ascending  order  the  ni  +  n2  +  • • •  +  nk  observations  from 
the  combined  samples. 

2.  Assign  the  ranks  1,  ...,  1*1  +  n2  +  ...  +nk  to  the  ordered  observa¬ 
tions.  If  ties  occur  average  the  ranks  of  the  observations  involved  in 
the  tie  and  assign  the  average  as  the  rank  of  each  tied  value. 

3.  Determine  the  sum  of  the  ranks  assigned  to  the  Xj^,  R]_,  the  sum  of 
the  ranks  assigned  to  the  X2,,  R2,  ...,  and  the  sum  of  the  ranks  assigned  to 
the  Xki,  Rk: 

nj 

R .  -  I  R(Xj . )  j  =  1,  ...»  k 

3  i-1 
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j. 


Three  types  of  wire  cord  are  being  considered,  with  which  to  secure  a 
certain  heavy  piece  of  aerospace  ground  equipment  to  cargo  loading  pallets 
for  transport  by  a  C-5.  Samples  of  the  three  types  of  wire  cord  were  sub¬ 
jected  to  a  test  of  breaking  strength.  The  measured  forced  (in  coded  units) 
required  to  break  the  wire  cords  were  as  follows: 


Type  1 
Type  2 
Type  3 


250,  220,  315,  260,  205 

185,  230,  215,  245 

260,  285,  245,  250,  320 


Do  the  average  breaking  strengths  differ  among  the  three  types  of  wire 
cord?  Test  at  a  *0.05. 


Xj  -  random  variable,  breaking  of  type  j  wire  cord,  j  «  1,  2,  3 


H  :  The  median  breaking  strengths  of  the  three  types  of  wire  cord 
are  equal. 


EL:  They  are  not  equal,  i.e.,*at  least  one  type  of  wire  cord  has 
a  greater  median  breaking  strength  than  the  others. 


The.  rank-ordered,  ascending  arrangement  of  the  combined  samples,  with  ties 
denoted  by  an  overbar  C-) ,  is: 


Rank: 

1 

2 

3 

4 

5 

6.5 

6.5 

Type: 

2 

1 

2 

1 

2 

2 

3 

Observation: 

185 

205 

215 

220 

230 

245 

245 

Rank: 

8.5 

8.5 

10.5 

10.5 

12 

13 

14 

Type 

1 

3 

1 

3 

3 

1 

3 

Observation 

250 

25Q 

260 

260 

285 

315 

320 

Ties  assigned  average  of  tied  ranks 


nl 

Ri  -  Z  R(XX  ) 
i-1  1 


n^  -  5 


2  +  4  +  8.5  +  10.5  +  13 


38 


n 


1 
Z 

i-1 

1+3+5+6.5-15.5 


*2  “  A  RtV 


n2  ’  4 


n3 


R.  -  S  R(X,  ) 
i-1  1 


n3  *  5 


6.5  +  8.5  +  10.5  +  12  +  14  -  51.5 


Check:  N  -  n^  +  +  n3 


-5+4+5-14 


R.  +  R_  +  R.  -  38  +  15.5  +  51.5  -  105  - 


T 


SLlfiil  -  iilUiSL  -  105  ye, 
12  *  j^Ri"  ft>VN+1>J2( 

N(N+1)  jf1  j  nj  j 


,  12  j  [38-  fo) (5) (15) I2 

(14)  (15)  |  -  5 

.  [15.5  -  (%)  (4) (15)]2 
4 

+  [51.5  -  ft)  (5) (15) l2 j 

-(0.057)  [(0.05)  +  (52.5625)  +  39.2)] 
-5.233 

•P  «  T  3  T 

c  W  —0.05;  k  -  3,  n^  ■  5,  4,  5 


nl  "  5»  n2  "  4r  n3  “  5 

but  the  order  of  the  sample  size,  nj  doe, 
not  affect  the  critical  value  determination 
from  a  table. 


Wo,  k  .  3,  „4  .  5,  5,  4  -  5'6429  trom  T'bl*  «  283  1 


T  —  5.233  <  5.6420  -  T 

c 

Conclusion:  Fail  to  reject  H  .  There  is  insufficient  sample  evidence  to 
conclude  that  the  average  breaking  strengths  differ  among  the  three  types 
of  wire  cord,  i.e. ,  that  at  least  one  type  of  wire  cord  has  a  greater 
average  breaking  strength  than  the  others,  at  the  a  —  0.05  level  of  signi¬ 
ficance. 

Reference (s) .  Siegel,  pp.  184-193. 
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Other  test (s) 


1.  There  is  a  correction  for  excessive  numbers  of  ties;  see  Siegel, 
pp.  191-192. 

2.  The  extension  of  the  median  test  also  provides  a  method  to  test 
whether  k  independent  populations  differ  in  central  tendencies  and  may  also 
be  employed  whenever  observations  are  in  at  least  an  ordinal  scale.  How¬ 
ever,  the  extension  of  the  median  test  is  in  essence  a  X2  test  for  k  samples 
which  involves  a  2  x  k  contingency  table  and,  hence,  suffers  from  the  limits- 
tins  of  contingency  table  cell  size  criteria)  See  Siegel,  pp.  179-184. 


C.  Measures  and  Tests  of  Correlation. 

The  Two— Sample  Case:  Spearman  Rank  Correlation  Coefficient  Test 

General  description/purpose.  The  Spearman  rank  Correlation  coefficient 
test  may  be  used  to  test  whether  a  statistically  significant  correlation 
(linear  association)  exists  between  the  paired  values  from  two  processes. 

In  the  case  of  the  existence  of  a  statistically  significant  correlation, 
the  Spearman  rank  correlation  coefficient  may  be  used  as  a  measure  of  the 
extent  of  the  correlation  (degree  of  the  linear  association) . 

Assumptions/limitations : 

1.  The  usual  parametric  technique  for  analyzing  data  from  two  related 
samples  for  correlation  is  the  application  of  the  t-test  to  the  Pearson 
produce-moment  correlation  coefficient.  There  are,  however,  circumstances 
under  which  the  correlation  t-test  is  inapplicable: 

a.  The  assumptions/requirements  of  the  correlation  t-test  are 
considered  unrealistic  or  are  found  not  to  hold  for  the  data. 

b.  It  is  preferable  to  avoid  making  the  assumptions  or  testing 
the  requirements  of  the  correlation  t-test  and,  thus,  give  greater  gener¬ 
ality  to  the  conclusions  drawn  Cat  the  expense  of  employing  a  less  powerful 
test) . 

2.  For  n  pairs  of  observations,  (X^,  Y^l : 

a.  The  measurement  scale  for  the  Xs  and  the  Ys  is  at  least 

ordinal. 

b.  The  pairs,  (Xj.,  Y^) ,  i  ■  1,  ...,  n  constitute  a  random  sample 
from  a  bivariate  population  distribution  of  continuous  random  variables  x 
and  Y. 

Discussion/methodology : 

The  procedure  for  application  of  the  Spearman  rank  correlation  coef¬ 
ficient  test  involves  these  steps: 
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1.  Assign  the  ranks  1,  n  in  ascending  order  to  the  observations 

of  each  sample  separately.  If  ties  occur,  average  the  ranks  of  the  obser¬ 
vations  involved  in  the  tie  and  assign  the  average  as  the  rank  of  each 
tied  value. 

2.  For  each  pair  of  observations,  (X*,  ,  determine  the  difference 

between  the  ranks  of  the  component  observations,  dj.: 

di  -  ROC*)  -  RtY*) 

where  R(Xj.)  is  the  rank  assigned  to  the  observation  Xj,  and  R(Y^) 
similarly. 

3.  Determine  the  value  of  the  Spearman  ranlt  correlation  coefficient, 
rs,  which  is  the  test  statistic! 


4.  Determine  from  a  table  the  critical  value  for  the  test  statistic, 
'critical  or  rc5 

if  a  two-tailed  test  at  the 
a-level  of  significance 

if  a  one-tailed  test  at  the 
a-level  of  significance 

5.  Resolve  the  hypothesis  test:  if  |rB|  >  rQ,  reject  Rq. 

EXAMPLE  22-10 

While  undergoing  Basic  Military  Training  (BMT) ,  enlistees  are  given 
both  verbal  skills  and  quantitative  skills  tests  to  assist  in  the  selection 
of  their  career  fields  of  assignment.  To  determine  whether  there  is  a 
correlation  (linear  association)  between  the  verbal  skills  and  the  quanti¬ 
tative  skills  evidenced  by  enlistees  tested  during  BMT,  ten  enlistees 
were  randomly  selected,  but  without  their  knowledge.  Their  scores  on  the 
two  skills  tests  were  as  follows: 
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Verbal  skills 

Quantitative  skills 

Enlistees 

test  score 

test  score 

1 

90 

70 

2 

100 

60 

3 

75 

60 

4 

80 

80 

5 

60 

75 

6 

75 

90 

7 

85 

100 

8 

40 

75 

9 

95 

85 

.0 

65 

65 

Does  there  exist  a  correlation  (linear  association)  between  the  verbal 
skills  and  the  quantitative  skills  evidenced  by  enlistees  tested  during 
BMT?  Test  at  a  ■  0.05. 

X  “  random  variable,  verbal  skills  test  score  of  an  enlistee 
tested  during  BMT 

Y  “  random  variable,  quantitative  skills  test  score... 


V  PXY 


(X  and  Y  are  uncorrelated) 


H. :  p  0  (X  and  15  are  correlated,  i.e.,  either  there  is  a 
tendency  for  larger  values  of  x  to  be  paired  with 
larger  values  of  Y  (direct  correlation) ,  or  there 
is  a  tendency  for  smaller  values  of  x  to  be  paired 
with  larger  values  of  Y  (indirect  or  inverse 
correlation) . 

The  ranked  observations  and  the  differences  of  these  ranks,  d^,  are: 


Enlistee 


R(Y* ) 


The  value  of  the  Spearman  rank  correlation  coefficient,  rs,  is: 


r 

s 


6  2 
1  *  ^  di 


n  ■  10 


-{ 


1  '  i (1577997  E<4)2  +  (8*5)2  +  •**  +  (0)2]  } 


-Cf— 1 

[990  j 


(161) 


«  0.0242 


rc  “  ro/2  "  *0.025,  n-10  *  °‘6364  from  Table  22‘2 

|r  I  as  0.0242  <  0.6364  -  r 
1  s  1  c 


Conclusion:  Fail  to  reject  HQ.  There  is  insufficient  sample  evidence  to 
conclude  that  there  exists  a  correlation  between  the  verbal  skills  and  the 
quantitative  skills  evidenced  by  enlistees  tested  during  BMT  at  the  a  «  0.05 
level  of  significance. 

Reference (s) .  Siegel,  pp.  202-213. 

Other  test (s) . 

1.  There  is  a  correction  for  moderate  or  greater  numbers  of  ties;  see 
Siegel,  pp.  206-210. 

2.  For  large  samples,  especially  for  n  beyond  the  range  of  tabulated 
critical  values,  there  is  a  t-test  approximation;  see  Siegel,  pp.  212. 

3.  The  Kendall  rank  correlation  coefficient  test  also  provides  a 
method  to  determine  whether  the  paired  values  of  two  processes  are  corre¬ 
lated  and  may  also  be  employed  whenever  observations  are  in  at  least  an 
ordinal  scale  of  measurement.  The  Kendall  test  enjoys  a  preference  over 
the  Spearman  test  for  excessive  numbers  of  ties.  Since  both  tests  utilize 
the  same  amount  and  kind  of  sample  information  (ranks) ,  they  are  equally 
powerful  in  rejecting  Hq  and  thereby  detecting  the  existence  of  correlation 
(linear  association).  The  Kendall  rank  correlation  coefficient,  t,  and  the 
Spearman  rank  correlation  coefficient,  r8,  have  different  underlying  scales, 
however.  When  computed  from  the  same  data,  they  will  generally  differ 

in  numerical  value;  they  are  not  directly  comparable  to  each  other.  The 
Kendall  test  has  an  advantage  over  the  Spearman  test  in  that  t  can  be 
generalized  to  a  partial  correlation  coefficient.  See  Siegel,  pp.  223-229. 
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Table  22-2 


Critical  Values  of  the  Spearman  Test  Statistic 


n 

a*. 100 

.050 

.025 

.010 

.005 

.001 

4 

.8000 

.8000 

5 

.7000 

.8000 

.9000 

.9000 

6 

.6000 

.7714 

.8286 

.8857 

.9429 

7 

.6786 

.9643 

8 

.5000 

.6190 

.7143 

.8095 

.8571 

.9286 

9 

.4667 

.5833 

.6833 

.7667 

.8167 

.9000 

10 

.4424 

.5515 

.6364 

.7333 

.7818 

.8667 

11 

.4182 

.5273 

.6091 

.7000 

.7455 

.8364 

12 

.3986 

.4965 

.5804 

.6713 

.7273 

.8182 

13 

.3791 

.4780 

.5549 

.6429 

.6978 

.7912 

14 

.3626 

.4593 

.5341 

.6220 

.6747 

.7670 

15 

.3500 

.4429 

.5179 

.6000 

.6536 

.7464 

16 

.3382 

.4265 

.5000 

.5824 

.6324 

.7265 

17 

.3260 

.4118 

.4853 

.5637 

.6152 

.7083 

18 

.3148 

.3994 

.4716 

.5480 

.5975 

.6904 

19 

.3070 

.3895 

.4579 

.5333 

.5825 

.6737 

2Q 

.2977 

.3789 

.4451 

.5203 

.5684 

.6586 

21 

.2909 

.3688 

.4351 

.5078 

.5545 

.6455 

22 

.2829 

.3597 

.4241 

.4963 

.5426 

.6318 

23 

.2767 

.3518 

.4150 

.4852 

.5306 

.6186 

24 

.2704 

.3435 

.4061 

.4748 

.5200 

.6070 

25 

.2646 

.3362 

.3977 

.4654 

.5100 

.5962 

26 

.2588 

.3299 

.3894 

.4564 

.5002 

.5856 

27 

.2540 

.3236 

.3822 

.4481 

.4915 

.5757 

28 

.2490 

.3175 

.3749 

.4401 

.4828 

.5660 

29 

.2443 

.3113 

.3685 

.4320 

.4744 

.5567 

3Q 

.2400 

.3059 

.3620 

.4251 

.4665 

.5479 

The  k-Sample  Case:  Kendall  Coefficient  of  Concordance  Teat 

General  description/purpose:  The  Kendall  coefficient  of  concordance 
test  may  be  used  to  test  whether  a  statistically  significant  association 
exists  among  the  matched  values  from  k  processes.  In  the  case  of  the 
existence  of  a  statistically  significant  association,  the  Kendall  coef¬ 
ficient  of  concordance  may  be  used  as  a  measure  of  the  degree  of  the  asso¬ 
ciation.  The  values  of  the  k  variables  are  measured  in,  or  trams formed  to, 
rauiks.  When  observations  consist  of  n  sets  of  matched  values  from  the  k 
variables,  the  Kendall  coefficient  of  concordance  test  is  equivalent  to  the 
Friedman  2-Way  ANOVA  by  rauiks  test  with  the  roles  of  k  and  n  reversed. 

Assumptions/limitations : 

Same  as  for  the  Friedmaui  test  with  the  roles  of  k  and  n  reversed.  See 
page  22-18. 

Discussion/methodology : 

The  procedure  for  application  of  the  Kendall  coefficient  of  concordamce 
test  involves  these  steps: 

1.  Cast  the  observations  into  a  2-way  table  having  n  columns  (sets  of 
matched  values,  or  entities  to  be  ramked)  and  k  rows  (processes,  or  judges 
assigning  ranks) . 

2.  Rank  the  observations  in  each  row  from  1  to  n.  It  is  immaterial 
whether  the  ramking  is  from  lowest  to  highest  or  from  highest  to  lowest, 
however,  the  ranking  must  be  consistently  applied  across  all  k  rows. 

3.  Determine  the  sum  of  the  rauiks  in  each  column,  Rj,  and  the  mean  of 
these  ramk  sums ,  Rj : 


n 


4.  Determine  the  value  of  the  Kendall  coefficient  of  concordamce,  W, 
when  the  proportion  of  ties  is  not  large : 


(l/12)k2(nJ-n) 


where 


sum  of  squares  of  the  observed  deviations  from  the 
mean  of  the  Rj. 


k  «  number  of  rows  (sets  of  rankings,  or  judges) 

n  "  number  of  columns  (entities-objects  or  individuals -ranked) . 

12  3 

(yj)k  (n  -n)  ■  maximum  possible  sum  of  the  squared  deviations, 
i.e.,  the  sum  s  which  would  occur  with  perfect 
agreement  among  the  k  sets  of  rankings. 

Roughly  speaking,  H  is  an  index  of  the  divergence  of  the  actual  agreement 
evidenced  by  the  observations  from  the  maximum  possible  (perfect)  agreement 

2 

5.  Determine  the  value  of  the  test  statistics,  X^  ; 

Xw2  -  k(n-l)W 
2  2 

When  no  ties  are  permitted,  XM  *  X  zanJiB°*  the  Friedman  2-way  ANOVA  by 
ranks  test. 

2 

6.  Determine  the  critical  value  for  the  test  statistic,  X  . ^ .  ,  or 

_  critical 


A  "A  . 

c  a,  n-1 

2  2 

7.  Resolve  the  hypothesis  test;  If  X  >  X  ,  reject  H  . 

w  c  o 

EXAMPLE  22-11 

The  example  first  resolved  with  the  Friedman  2-way  ANOVA  by  ranks  test 
is  not  approached  as  a  Kendall  coefficient  of  concordance  test  application. 


CMSgt  A 


FINALISTS 

CMSgt  B  CMSgt  C 


CMSgt  D 


Of f icer- j  udges 


R3-ll 


R.-18 

4 


H  :  There  is  no  agreement  among  the  five  officer-judges  in  their 
rauikings  of  the  four  finalists. 

H. :  There  is  agreement. 


n 


n  =  4 


14  +  7  +  H  +18 


*■  12.5 


Z  (Rj  -  R.)2 
j-1  3  3 


(14  -  12.5)  +  (7  -  12. 5)2  +  (11  -  12. 5)2  +  (18 


2.25  +  30.25  +  2.25  +  30.25  -  65 


(1/12) k* (n^-n) 


k  *  5,  n  «  4 


(1/12)  (5^)  L  (4)  -*-4 


65 _ 

(1/12)  (25)  (60) 


k(n-l)W 


(5) (4-1) (0.52) 


ranks 


X  2  .  X2 
c  o 


a,n-l  0.10,3 


X  2  -  7.8  >  6.25  -  X  2 


Conclusion:  Reject  H  ,  etc. 

o 

Reference (s) .  Siegel,  pp.  229-238 


Other  test(s). 


1.  For  certain  cases  in  which  n  is  not  large,  the  critical  values  of 

s  are  tabulated  for  selected  k  and  levels  of  significance .  Table  (R) ,  page 
286  of  Siegel,  includes  the  cases  for  n  ■  3  to  7,  selected  k  between  3  and 
20,  and  a  =  0.01  and  0.05. 

2.  There  is  a  correction  for  excessive  numbers  of  ties;  see  Siegel, 
pp.  233-235. 


GOODNESS  OF  FIT  (GOF)  TESTS 

A.  One-Sample  Tests  Against  an  Hypothesized  Distribution. 

Chi  Square  GOF  Test 

General  description/purpose:  The  chi  square  GOF  test  may  be  used  to 
test  whether  (a)  statistically  significant  difference (s)  exist (s)  between 
an  empirical  distribution  and  an  hypothesized  theoretical  distribution, 
either  discrete  or  continuous,  the  parameter (s)  of  which  may  be  pre¬ 
specified  or  estimated  from  sample  observations.  Hie  test  is  an  applica¬ 
tion  of  a  chi  square  row  vector. 

Assumptions/limitations  j 

1.  The  discussion  on  page  22-1  applies. 

2.  When  the  hypothesized  theoretical  distribution  is  discrete,  each 
value  of  that  random  variable  may  constitute  a  cell.  However,  since  the 
test  must  meet  the  cell  size  criteria  of  a  chi  square  row  vector,  some 
combining  of  adjacent  cells  may  be  necessary. 

3.  When  the  hypothesized  theoretical  distribution  is  continuous, 
arbitrary  cells  must  be  created  by  the  investigator.  Each  cell  consists 
of  a  range  of  values  which  that  random  variable  can  take  on.  Hie  choices 
of  upper  and  lower  limits  for  each  cell  are  somewhat  arbitrary  and  require 
that  subjective  determinations  be  made  by  the  investigator. 

Discussion/methodology : 

Hie  discussion  on  page  22-1  applies,  except  that: 

1.  df  -  k  -  1  -  NEP 

Where  k  -  number  of  cells 

NEP  -  number  of  estimated  parameters. 

.  v2  v  2  v2 

2.  X  ...  ,  or  X  "X  ... 

critical  c  a,df 

2  2 

3.  Hie  test  is  a  one-tailed  test  to  the  right;  i.e.,  if  X.  >  X  , 

reject  hq.  °  c 


EXAMPLE  22-12 

(Discrete  distribution) .  The  chief  of  the  base  pharmacy  believes  that 
customers  arrive  for  service  in  accordance  with  a  Poisson  process.  To  test 
his  belief,  the  number  of  arrivals  was  observed  for  710  randomly  selected 
5  minute  periods,  as  follows: 

No.  of  arrivals 
within  5  minute 

time  period:  012  34567  or  more 

No.  of  time 

periods  observed:  50  100  150  155  110  65  60  20 

Do  these  data  support  the  base  pharmacy  chief's  belief  that  customer's 
arrivals  are  Poisson?  Test  at  a  »  0.05. 

X  =  random  variable,  the  number  of  arrivals  within  a  five  minute 
time  period  at  the  base  pharmacy. 


Hq  *  X  ^  Poisson  (X  ■  3.0) 

*  X  /  Poisson  (X  -  3.0)  X  *  X,  estimated  from  sample 


NO.  Of 

Arrivals 
within  5 
minute 
time  period 
(X) 

0 

1 

2 

1 

3 

4 

5 

6 

7  or 

more 

Total 

No.  of 
time 
periods 
observed 

icV 

50 

100 

150 

155 

110 

65- 

60 

20 

710-n 

P (X-i) -Pi 

0.05 

0.149 

0.224 

0.224 

0.168 

0.101 

0.05 

0.034 

| 

No.  of 

time 

periods 

expected 

(Ei-pi*n) 

36 

106 

159 

159 

119 

72 

36 

24 

711 
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-  i-0 


Z  X±f± 


(0)  (50)  +  (1)  (100)  +  (2)  (150)  +  ...  +  (7)  (20) 


*3.0  where  -  0^ 


2  2  7  (0i"V 

K  <or  *  >  -  z  ------ 

°  8  i-0  Ei 


(20-24) 

...  +  24"-  *  24.47 


(50-36) 2  (100-106) 2  (150-159) 2 

36  106  159 


df  -  (no.  of  cells)  -  1  -  (no.  estimated  parameters)  -  k  -  1  -  NEP 


8  -  1  -  1  «  6 


Y  2  y2 

c  "  0.05,  6 


X2  Jt  24.47  >  12.6  -  X  2 
o  c 


Conclusion:  Reject  Hq.  Customer  arrivals  at  the  base  pharmacy  are  not 
distributed  Poisson  (  X  -  3.0)  at  the  5%  level  of  significance. 

(Notef:  Since  X  -  3.0  was  estimated  from  the  observations,  this  value  is 
the  "best"  choice  for  the  parameter  X  df  a  candidate-Poisson.  Consequently 
as  Poisson  (X-3.0)  was  rejected,  no  other  Poisson  distribution  should  be 
expected  to  evidence  a  goodness  of  fit. 

EXAMPLE  22-13 

(Continuous  distribution) .  It  is  suspected  that  the  scores  of  USAF 
enlistees  on  a  particular  aptitude  test  administered  during  Basic  Military 
Training  (BMT)  are  normally  distributed.  The  scores  of  100  randomly 
selected  enlistees  were  as  follows  (blocked  data) : 


Score  on  a  particular 
aptitude  test 


No.  of  enlistees 
who  achieved  score 
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X  -  i  Z  Mifi  -  ^  [(61)  (5)  +  (64)  (18)  +  (67)  (42)  (70)  (27)  + 


(73) (8)]  *  67.45  where  f±  -  0i 


2 

s 


"  n^I  1  (Mi  "  X)2fi  *  '(100-lT  ^C61J  ~  67-45i2(5>  +  <64  -  67.45)  2(18) 
+  ...  +  (73  -  67.45) 2 (8) ]  S  8.526  ■  (2.92)2 


vx 


Bi  3 


i  SO 


B. 


62.5-67.45  _ 

2.92 


-1.7,  etc. 


X20  (or  X2g) 


k 

E 

i=*l 


E. 


«  0.681 


df  ■  k  -  1  -  NEP  -5-1-2-2 

’i  -  <05.  2  -  5'" 

x2  0.681  <  5.99  -  x2 
o  c 

< 

Conclusion.  Fail  to  reject  H0.  There  is  insufficient  sample  evidence  to 
conclude  that  the  scores  of  enlistees  on  a  particular  aptitude  test  admin¬ 
istered  during  BMT  are  not  distributed  normal  (y  *  67.45,  a2  *  (2.92) 2)  at 
the  5%  level  of  significance.  (See  accompanying  illustration  in  Figure 
22-1. 

Reference (s) .  Siegel,  pp.  42-47. 

Kolmogorov- Smirnov  (K-S)/Lilliefors  One-Sample  Tests 

General  description/purpose:  The  Kolmogorov-Smimov/Lilliefors  one- 
sample  GOF  tests  may  be  used  to  test  whether  (a)  statistically  significant 
difference (s)  exist (s)  between  an  empirical  distribution  and  an  hypothe¬ 
sized  theoretical  distribution  with  a  continuous  cumulative  distribution 
function,  the  parameter (s)  of  which  should  be  pre-specified. 


22-43 


Assumptions/limitations : 

1.  The  K-S  test  is  designed  for  the  conduct  of  a  goodness  of  fit  test 
of  an  empirical  or  sample  distribution  against  an  hypothesized  theoretical 
distribution  with  a  continuous  cumulative  distribution  function  (cdf) .  It 
may,  however,  be  applied  to  the  question  of  goodness  of  fit  against  an 
hypothesized  theoretical  distribution  with  a  discrete  cdf.  When  the  cdf 
is  continuous ,  the  level  of  significance  of  the  test  is  exactly  the  pre¬ 
specified  a-level;  when  the  cdf  is  discrete ,  the  test  is  conservative:  the 
true  level  of  significance  is  at  most  the  pre-specified  a-level.  (Gibbons: 
p.  85). 


2.  The  K-S  test  requires  that  all  parameters  of  the  hypothesized 
theoretical  distribution  be  specified  in  the  goodness  of  fit  hypothesis 
test.  If  any  parameters  are  estimated  from  the  sample  data,  the  exact 
distribution  of  the  K-S  test  statistic  max  D  -  Dn  is  no  longer  known.  When 
K-S  Dcritical  values  are  used,  the  resulting  test  is  extremely  conservative 
and  should  be  reaccomplished  with  the  Chi -Sepia re  test,  if  at  all  possible. 
There  are  two  noteworthy  exceptions:  when  the  hypothesized  theoretical 
distribution  is  normal  or  exponential,  the  parameters  may  be  estimated  from 
the  sample  data,  since  Lilliefors  has  tabulated  the  exact  Dcritical  va^ues 

(obtained  by  Monte  Carlo  calculations),  Lilliefors'  numerical  investiga¬ 
tions  indicate  that  Dn  provides  a  more  powerful  test  against  these  distri¬ 
butions  than  the  Chi-Square  when  his  revised  D  .  .  .  values  are  used. 

(Gibbons:  p.  87).  critical 


3.  Comparison  of  K-S  and  Chi-Square  tests: 


a.  One  of  the  primary  advantages  of  K-S  is  that  the  exact  distri¬ 
bution  of  Oh  is  known  and  tabulated,  whereas  the  distribution  of  Xq  is  only 
approximately  Chi-Square  for  any  finite  n.  (It  is  asymptotically  X2.) 


b.  Another  advantage  of  the  K-S  is  that  it  can  be  applied  for  any 
sample  size  n,  whereas  the  approximation  x£  4,  *  is  only  "safe"  for  large 
n.  Application  of  the  Chi-Square  requires  satisfaction  of  the  cell  size 
criteria;  whenever  cell  combination  is  required,  the  calculated  X2  is  no 
longer  exact,  since  it  is  affected  by  the  scheme  of  combination.  In 
contradistinction,  the  K-S  treats  (ungrouped)  individual  sample  observa¬ 
tions  separately  and,  consequently,  need  not  lose  information  through  the 
combination  of  cells.  The  principal  advantage  of  the  Chi-Square  is  that, 
when  the  parameters  of  the  hypothesized  theoretical  distribution  are  not 
specified,  they  can  be  estimated  from  the  saiqple  data,  and  a  goodness  of 
fit  test  can  be  conducted  in  the  usual  manner  by  sisqply  reducing  the  df 
of  X£. 


c.  Studies  of  the  power  functions  of  both  tests  have  been  reported 
in  the  literature  in  efforts  to  characterize  their  relative  performances. 
However,  these  do  not  seem  to  provide  a  definitive  basis  for  choice,  in 
general . 
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ferable  for  small  sample  sizes.  (Gibbons:  pp.  86-87.) 
Discussion/methodology : 


The  procedure  for  application  of  the  Kolmogorov- Smimov/Lilliefors  test 
involves  these  steps: 

1.  Specify  the  hypothesized  cumulative  distribution  function  F(X) 
including  all  its  parameters,  unless  a  Lilliefors  test  against  a  normal 
or  an  exponential  where  the  parameter  (s)  may  be  estimated  from  the  obser¬ 
vations. 


2.  Arrange  the  n  observations  in  a  sample  cumulative  distribution 
(step)  function  S(X) . 

3.  Pair  each  interval  of  S 00  with  the  comparable  interval  of  p(x) . 

4.  For  each  step  on  the  cumulative  distribution  functions,  determine 
the  absolute  difference (s)  between  SCX]  and  PCX). 

5.  Determine  the  value  of  the  test  statistic,  Dn:  i 

Dn  -  max  D  -  max  JfCX^  -  SCX^  |  i  »  1,  ...»  n 

6.  Determine  from  an  appropriate  table  the  critical  value  for  the 
test  statistic,  »axitLcal  or  Dc: 


7.  Resolve^  the  hypothesis  test:  if  D,t  >  Dc,  reject  Ho. 
Figure  22-2.  Construction  *: f  K-S/Lillief ors  Test  Statistics 


(Discrete  data).  A  dafaaaa  oontractor  contanda  that  tha  breakaway 
praasura  in  lha.  of  a  cartain  valve  ia  diatributad  N{32,  (1.8)2) .  To  taat 
thia  contention,  a  randcn  sample  of  10  breakaway  praaaoraa  waa  obtainad  aa 
follows »  31.4,  33.5,  31.0,  36-2,  33.3,  33.7,  34.9,  33.4,  37.0.  Do  thaaa 

data  support  tha  oontractor' a  contantion  that  tha  braakaway  preaaura  of  a 
cartain  valva  ia  diatributad  H(32,  (1.8)2)?  Taat  at  a  -  0.05. 


x  -  randan  variabla,  braakaway  praaaura  in  lba.  of  a  cartain  valva. 
Hq  -  X  *  11(32,  (1.8) 2) 

-  X  *  11(32,  (1.8)2) 


31.0 

-.56 

.2877 

31.4 

-.33 

.3707 

33.3 

+.72 

.7642 

33.4 

+.78 

.7823 

33.5 

+.83 

.7967 

33.7 

+.94 

.8264 

34.4 

+1.33 

.9082 

34.9 

+1.6 

.9452 

36.2 

+2.3 

.9893 

37.0 

+2.8 

.9974 

Juat  Bafore 
Diffaranoas 


S(Xi) 


Just  Aftar 
Differences 


.2877 

.2707 

.5642 

.4823 

.3967 

.3264 

.3082 

.2452 

.1893 

.0974 


.1877 

.1707 

.4642 

.3823 

.2967 

.2264 

.2082 

.1452 

.0893 

.0026 


D  -  D,  _ 
n  10 


|F(*t)  -  |  -  0.5642  i  -  1,  ...»  10 


Dc  ”  Da,n  "  ®0.05,  10 


0.409  fron  Table  22-3  (X-S  Test  Critical 
Values) 


0.5642  >  0.409  -  D 


Conclusion:  Reject  Hq..  The  braakaway  pressure  of  a  cartain  valva  is  not 
distributed  N(32,  (1.8) 2)  at  tha  5%  level  of  significance. 


is 


20 

.10 

.0! 

10 

.950 

.9 

14 

.776 

.8. 

15 

.636 

.71 

13 

.565 

.6; 

02 

.01 

990 

.995 

900 

.929 

785 

.829 

689 

.734 

627 

.669 

577 

.617 

538 

.576 

507 

.542 

480 

.513 

457 

.489 

437 

.468 

419 

.449 

404 

.432 

390 

.418 

377 

.404 

366 

.392 

355 

.381 

346 

.371 

337 

.361 

329 

.352 

321 

.344 

314 

.337 

307 

.330 

301 

.323 

295 

.317 

290 

.311 

284 

.305 

279 

.300 

275 

.295 

270 

.290 

266 

.285 

262 

.281 

258 

.277 

254 

.273 

251 

.269 

247 

.265 

244 

.262 

241 

.258 

238 

.255 

235 

.252 

Table  entries  are  critical  values  Wa  such  that  Prob  (W  ^  Wa).  -  a 

Source:  Adapted  from  L.  H.  Miller,  "Tables  of  Percentage  Points  of  Kolmogorov 

Statistic,”  Journal  of  the  American  Statistical  Assoc.  51  (1956) ,pp.  111-21. 


EXAMPLE  22-15 


(Blocked  data) .  Hie  normal  distribution  example  first  resolved  with 
the  Chi  square  GOF  test  is  now  approached  as  a  Lilliefors  test  application. 


X  ■  random  variable,  the  score  of  an  enlistee  on  a  particular 
aptitude  test  (integer  value) 


Hq:  X^N  (67.45,  (2.92)  ) 


>2) 
,  2. 


H^;  X  N  (67.45,  (2.92) “)  X  *  y,  s2  ■  a2,  estimated  from 

sample 


Class 

Class  Boundray 


i  Interval  B. 


B. 

1 


f£i 

n 


S(Xi)  F(X±) 


1  60-62 


2  63-65 


3  66-68 


4  69-71 


5  72-74 


— 0©  —00 


62.5  -1.70  l 


65.5 


l 

-0.67  ) 


68.5  0.36 


71.5  y.  39 


'tS 


42 


27 


5 

100 


23 

100 


65 

100 


92 

100 


100 

100 


n  -  100 


0.05  0.0446  0.0054 


0.23  0.2514  0.0214 


0.65  0.6406  0.0094 


0.92  0.9177  0.0023 


1.0  1.0 


3n  "  °100  "  max  lF(3V  _  S(V  I  "  0-0214 


VBi 

i  -  1,  ...»  5 

°c  *  Da,n  "  D0.05,100  *  °’0886 

from  Table  22-4  (Lilliefors  normal  test  critical  values) 


D  -  0.0214  <  0.0886  -  D 
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Conclusion.  Fail  to  reject  H  ,  etc. 

x  O 

Reference (s) 

1.  Gibbons,  Jean  Dickinson.  Nonparametric  Statistical  Inference. 
(McGraw-Hill:  New  York,  1971)  pp.  73-87. 

2.  Siegel,  pp.  47-52. 


Table  22-4 

Critical  Values  of  the  Lilliefors  Test  Statistic 


for  Jin  Unspecified 

Normal 

Distribution 

a-.  20 

.15 

.10 

.05 

.01 

Sample  size  n«  4 

.300 

.319 

.352 

.381 

.417 

5 

.285 

.299 

.315 

.337 

.405 

6 

.265 

.277 

.294 

.319 

.364 

7 

.247 

.258 

.276 

.300 

.348 

8 

.233 

.244 

.261 

.285 

.331 

9 

.223 

.233 

.249 

.271 

.311 

10 

.215 

.224 

.239 

.258 

.294 

11 

.206 

.217 

.230 

.249 

.284 

12 

.199 

.212 

.223 

.242 

.275 

13 

.190 

.202 

.214 

.234 

.268 

14 

.183 

.194 

.207 

.227 

.261 

15 

.177 

.187 

.201 

.220 

.257 

16 

.173 

.182 

.195 

.213 

.250 

17 

.169 

.177 

.189 

.206 

.245 

18 

.166 

.173 

.184 

.200 

.239 

19 

.163 

.169 

.179 

.195 

.235 

20 

.160 

.166 

.174 

.190 

.231 

25 

.149 

.153 

.165 

.180 

.203 

30 

.131 

.136 

.144 

.161 

.187 

Over  30 

.736 

.768 

.805 

.886 

1.031 

~nr 

~^T 

& 

Table  entries  are 

critical  values  Wa  wuch  that  Prob 

(w  >>a) 

■  a 

Source:  Adapted  from  H.  W.  Lilliefors,  "On  the  Kolmogorov-Smimov  Test 
for  Normality  with  Mean  and  Variance  Unknown,"  Journal  of  the  American 
Statistical  Association,  62  (1967),  pp.  399-402. 
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